Gene coding for erbin, and diagnostic and therapeutic uses thereof

ABSTRACT

The invention concerns a novel gene, coding for a protein named Erbin (Erbb2 Interacting protein), which interacts with the intracellular part of ERBB2/HER-2 receptors, involved in particular in the development of cancers, the diagnostic and therapeutic uses of novel nucleotide sequences and identified amino acids.

[0001] The present invention relates to a novel gene which codes for a protein designated as Erbin which interacts with the receptor ERBB2/HER2.

[0002] ERBB2/HER2 is a receptor with tyrosine kinase activity belonging to the family of EGF receptors implicated in carcinomas (epithelial cancers) affecting various human organs such as the liver, the kidney and the breast for example (Ross et al, 1999; Klapper et al, 2000). This implication is directly linked to the enzymatic activity of the receptor (called tyrosine kinase) and its intracellular signalling, as numerous studies in vitro have shown. The gene erbb2 is amplified in approximately 30% of cancers of the breast and constitutes a factor of poor prognosis in terms of the survival of the patient. The fact that the receptor is overexpressed on the tumour cells and that it has an exacerbated enzymatic activity arouses a keen interest in human therapeutics. Two approaches are currently employed to target this receptor and to diminish its tumorigenic effect in man.

[0003] The first consists of blocking the excess of ERBB2/HER2 and therefore the growth of the tumours by means of a monoclonal antibody called Herceptin® or Trastuzumab® (marketed by GENENTECH) used in breast cancers. However, this medication was only effective in half of the women who were tested. The second approach is the use of inhibitors of the enzymatic activity of the receptor. No medication utilising this approach has currently been put on the market. There is therefore a need for other means capable of modulating the activity of the receptor and/or its expression in the tumours.

[0004] The authors of the present invention have managed to identify with precision a new gene which codes for a protein which fixes to the intracellular part of the receptor, constitutes a specific new adapter of ERBB2 and is implicated in its sub-cellular localisation. This new protein has been called Erbin, for “Erbb2 interacting protein”.

[0005] Annexed hereto is a list of sequences in which the sequence SEQ ID no. 1 represents the cDNA of the Erbin gene in man corresponding to the open reading frame (ORF). This ORF (nucleotides 324 to 4436 on SEQ ID no. 1) codes for a protein of 1371 amino acids of which the sequence is presented in SEQ ID no. 2.

[0006] Three regions are defined in the protein:

[0007] amino-terminal region (residues 1 to 501) containing LRR motifs,

[0008] carboxy-terminal region (residues 1279 to 1371) comprising the PDZ domain;

[0009] central region (residues 502 to 1278). This is a specific region of Erbin, not retained in other proteins, the function of which is to fix other proteins, with enzymatic activity (phosphatase, kinase . . . ) or without enzymatic activity, implicated in the functioning of Erbin or its localisation. This protein fragment or the polypeptides comprising this fragment also form part of the invention. The association may be constitutive or modulated by a post-traductional modification of Erbin (phosphorylation . . . ). With regard to the primary sequence of this region, the domains SH3 or WW of cytosolic proteins may be fixed on the residues 971 to 977, the residues 1100 to 1105, the residues 1115 to 1121. The protein fragments or the peptides or polypeptides comprising these fragments also form part of the invention.

[0010] The annexed list of sequences also comprises a fragment of cDNA of the Erbin gene in the mouse (SEQ ID no. 3) of which the coding part (nucleotides 1 to 1485 of SEQ ID no. 3) codes for an incomplete protein of 495 amino acids, represented by SEQ ID no. 4.

[0011] This mouse peptide sequence is homologous to the peptide sequence of human Erbin (more than 80% identity at the protein or nucleotide level, between the residue 914 and the residue 1371 of the human sequence SEQ ID no. 2), with the exception of the peptide between the residues 296 and 334 of SEQ ID no. 4 (corresponding to the nucleotides 889 to 1006 on SEQ ID no. 3) which is specific to the mouse sequence. These specific murine fragments also form part of the invention.

[0012] The sequence SEQ ID no. 5 comprises a partial genomic sequence of mouse Erbin. It contains the first exon of Erbin (nucleotides 2347 to 2535 on SEQ ID no. 5, which have a 94% identity with the human sequence and code for the residues 1 to 63 of mouse Erbin on SEQ ID no. 4, 100% identical to the human sequence). The isolated nucleic acid comprising a sequence comprising the nucleotides 2347 to 2535 on SEQ ID no. 5 also forms part of the invention.

[0013] The sequence SEQ ID no. 6 is a nucleotide sequence which is found inserted between the nucleotides 3956 and 3957 of human Erbin (SEQ ID no. 1), retaining its open reading frame, in a variant.

[0014] This sequence SEQ ID no. 6 codes for the peptide SEQ ID no. 7, which is inserted between the amino acids 1211 and 1212 of SEQ ID no. 2.

[0015] The sequence SEQ ID no. 8 is a probe in 5′ of the human sequence of Erbin (nucleotides 74 to 750 of SEQ ID no. 1), whilst the sequence SEQ ID no. 9 is a probe in 3′ (nucleotides 4240 to 4600 of SEQ ID no. 1).

[0016] The sequences SEQ ID no. 10 to SEQ ID no. 20 are useful triggers for amplifying the cDNA of human Erbin by PCR, according to the following table: correspondence of nucleotides on SEQ ID no. SEQ ID no. 1 10 136 to 166 11 174 to 205 12 327 to 350 13 2788 to 2809 14 2812 to 2840 15 2892 to 2922 16 2993 to 3036 17 3199 to 3230 18 3339 to 3371 19 4458 to 4486 20 4490 to 4518

[0017] The sequence SEQ ID no. 21 represents the last nine carboxy-terminal amino acids of ERBB2.

[0018] The object of the present invention therefore is an isolated nucleic acid comprising the sequence SEQ ID no. 1, no. 3 or no. 5. It is understood that this also includes the homologous sequences defined as:

[0019] i) sequences similar to at least 70% of the sequence SEQ ID no. 1, no. 3 or no. 5; or

[0020] ii) sequences hybridising with the sequence SEQ ID no. 1, no. 3 or no. 5 or its complementary sequence under stringent hybridisation conditions, or

[0021] iii) sequences coding for the polypeptide called Erbin, as defined previously.

[0022] Preferably a homologous nucleotide sequence according to the invention is similar to at least 75% of the sequences SEQ ID no. 1, no. 3 or no. 5, more preferably to at least 85%, or at least 90%.

[0023] In a preferred manner, such a homologous nucleotide sequence hybridises specifically to complementary sequences of the sequence SEQ ID no. 1, no. 3 or no. 5 under stringent conditions. The parameters define the conditions of stringency depending upon the temperature at which 50% of the paired strands separate (Tm).

[0024] For sequences comprising more than 30 bases, Tm is defined by the relationship: Tm=81.5+0.41(%G+C)+16.6Log(concentration of cations)−0.63(%formamide)−(600/number of bases) (Sambrook et al, 1989).

[0025] For sequences of a length less than 30 bases, Tm is defined by the relationship: Tm=4(G+C)+2(A+T).

[0026] In appropriate conditions of stringency at which the aspecific sequences do not hybridise, the hybridisation temperature can preferably be from 5 to 10° C. below Tm, and the hybridisation buffers used are preferably solutions of high ionic force such as a solution 6×SSC for example.

[0027] The term “similar sequences” employed above refers to the perfect resemblance or identity between the nucleotides compared but also to the non-perfect resemblance which is qualified as similarity. This search for similarities in the nucleic sequences distinguishes for example the purines and pyrimidines.

[0028] Therefore a nucleotide sequence homologous to the ORF represented in SEQ ID no. 1, no.3 or no. 5 includes any nucleotide sequence which differs from the sequence SEQ ID) no. 1, no. 3 or no. 5 by mutation, insertion, deletion or substitution of one or several bases, or by the degeneration of the genetic code, in so far as it codes for a polypeptide having the biological activity of Erbin as defined below.

[0029] Amongst such homologous sequences are included the sequences of genes of mammals other than man, coding for Erbin, preferably of a primate, bovine, ovine or porcine, or even a rodent as well as the allelic variants.

[0030] The present invention also relates to an isolated polypeptide, called Erbin, comprising the amino acid sequence SEQ ID no. 2 or no. 4. It is understood that this also includes the homologous sequences defined as:

[0031] i) sequences similar to at least 70% of the sequence SEQ ID no. 2 or no. 4; or

[0032] ii) sequences coded by a homologous nucleic acid sequence as defined previously, that is to say a nucleic acid sequence hybridising with the sequence SEQ ID no. 2 or no. 4 or its complementary sequence under stringent hybridisation conditions.

[0033] Here too, the term “similar sequences” employed above refers to the perfect resemblance or identity between the amino acids compared but also to the non-perfect resemblance which is qualified as similarity. This search for similarities in a polypeptide sequence takes into account the conservative substitutions which are substitutions of amino acids of the same class, such as substitutions of amino acids with non-charged side chains (such as asparagine, glutamine, serine, threonine and tyrosine), amino acids with basic side chains (such as lysine, arginine and hisstidine), amino acids with acid side chains (such as aspartic acid glutamic acid); amino acids with apolar side chains (such as glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan and cysteine).

[0034] More generally, therefore, “homologous sequence of amino acids” is understood to mean any sequence of amino acids which differs from the sequence SEQ ID no. 2 or no. 4 by substitution, deletion and/or insertion of an amino acid or of a reduced number of amino acid, particularly by substitution of natural amino acids by non-natural amino acids or amino pseudoacids at positions such that these modifications do not significantly undermine the biological activity of Erbin.

[0035] Preferably such a homologous sequence of amino acids is similar to at least 85% of the sequence SEQ ID no. 2 or no. 4, preferably at least 95%.

[0036] The homology is generally determined using sequence analysis software (for example Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705). Similar amino acid sequences are aligned in order to obtain the maximum degree of homology (i.e. identity or similarity, as defined above). For this purpose it may be necessary to introduce gaps into the sequence artificially. Once the optimum alignment is achieved, the degree of homology is established by recording all the positions for which the amino acids of the two sequences compared are identical, relative to the total number of positions.

[0037] The “biological activity of Erbin” refers to its capacity for fixing on the intracellular part of the ERBB2/HER2 receptor and to its capacity to target the receptor towards the basolateral epithelium, where it can participate in the transduction of signals.

[0038] The invention also relates to variants of Erbin, such as the following two variants:

[0039] variant designated “1302”, comprising the sequence SEQ ID no. 1 with a deletion of the nucleotides 3957 to 4163, retaining the open reading frame. The protein coded by this variant has as sequence the sequence SEQ ID no. 2 deleted from residues 1212 to 1280. The isolated polypeptide comprising the sequence of amino acids 1212 to 1280 of SEQ ID no. 2 also forms part of the invention.

[0040] variant designated “1419”, comprising the sequence SEQ ID no. 1 with an insertion between the nucleotides 3956 and 3957 of this sequence of the fragment SEQ ID no. 6 which codes for the peptide SEQ ID no. 7.

[0041] All of the references to the nucleotide sequence SEQ ID no. 1 or to the sequence of amino acids SEQ ID no. 2 in the present description extend to these variants.

[0042] The invention also relates to the peptide of sequence SEQ ID no. 7 and the nucleic acid which codes for this peptide, such as the nucleic acid of sequence SEQ ID no.6.

[0043] The polypeptide of the present invention can be synthesised by all the methods well known to the person skilled in the art. The polypeptide according to the invention can for example be synthesised by the techniques of synthesis chemistry, such as the Merryfield type of synthesis which is advantageous for reasons of purity, antigenic specificity, absence of unwanted secondary products and its ease of production.

[0044] A recombinant protein can also be produced by a process in which a vector containing a nucleic acid comprising the sequence SEQ ID no. 1 or no. 3 or a homologous sequence is transferred into a host cell which is cultured in conditions permitting the expression of the corresponding polypeptide.

[0045] The protein produced can then be recovered and purified.

[0046] The purification processes used are known to the person skilled in the art. The recombinant polypeptide obtained can be purified from lysates and cellular extracts, from the supernatant of the culture medium, by methods used individually or in combination, such as fractionation, methods of chromatography, techniques of immunoaffinity with the aid of specific monoclonal or polyclonal antibodies, etc.

[0047] The nucleic acid sequence of interest here which codes for Erbin can be inserted into an expression vector in which it is bonded in an operating manner to elements permitting the regulation of its expression, such as in particular promoters, activators and/or terminators of transcription.

[0048] The signals controlling the expression of the nucleotide sequences (promoters, activators, termination sequences . . . ) are chosen as a function of the cellular host used. For this purpose the nucleotide sequences according to the invention can be inserted into vectors with autonomous replication within the chosen host or vectors integrating with the chosen host. Such vectors will be prepared according to the methods currently used by the person skilled in the art, and the clones resulting therefrom can be introduced into an appropriate host by standard methods, such as for example electroporation or precipitation with calcium phosphate.

[0049] The cloning and/or expression vectors such as are described above, containing a nucleotide sequence defined according to the invention, also form part of the present invention.

[0050] The invention also relates to host cells transfected, in a transitory or stable manner, by these expression vectors. These cells can be obtained by the introduction into the host cells, prokaryotes or eukaryotes, of a nucleotide sequence inserted into a vector such as is defined above, then the culturing of the said cells in conditions permitting the replication and/or the expression of the transfected nucleotide sequence.

[0051] Examples of host cells include in particular cells of mammals, such as the cells COS-7, 293, MDCK, cells of insects such as the cells SF9, of bacteria such as E. coli and of strains of yeasts such as L40 and Y90.

[0052] The nucleotide sequences of the invention may or may not be of artificial origin. They may be DNA or RNA sequences, obtained by screening of banks of sequences by means of probes developed on the basis of the sequence SEQ ID no. 1, no. 3, no. 5 or 6. Such banks can be prepared by conventional techniques of molecular biology which are known to the person skilled in the art.

[0053] The nucleotide sequences according to the invention can also be prepared by chemical synthesis, or even by mixed methods including the chemical or enzymatic modification of sequences obtained by screening of the banks.

[0054] The nucleotide sequence of the invention permits the production of probes or primers which hybridise specifically with the sequence SEQ ID no. 1, no. 3, no. 5 or 6 according to the invention, or its complementary strand. The appropriate hybridisation conditions correspond to the conditions of temperature and of ionic force usually used by the person skilled in the art, preferably under stringent conditions such as have been defined previously. These probes can be used as a diagnostic tool in vitro for the detection, by hybridisation experiments, notably by hybridisation “in situ”, of specific transcripts of the polypeptide of the invention in biological specimens or for demonstrating aberrant syntheses or genetic anomalies resulting from a polymorphism, mutations or poor splicing.

[0055] The nucleic acids of the invention which are useful as probes have a minimum of 15 nucleotides, preferably at least 20 nucleotides, more preferably at least 100 nucleotides.

[0056] The nucleic acids which are useful as primers have a minimum of 15 nucleotides, preferably at least 18 nucleotides, more preferably at least 40 nucleotides.

[0057] More precisely, the present invention relates to a nucleic acid having at least 15 nucleotides, which hybridises specifically with one of the nucleic acid sequences SEQ ID no. 1, or its complementary one, in stringent hybridisation conditions.

[0058] The nucleic acids consisting of the sequences SEQ ID no. 8 or SEQ ID no. SEQ ID no.9 can be used for example as probe.

[0059] It is also possible to use as probe the fragments of nucleic acid from the nucleotides 1 to 750 of SEQ ID no. 1 and the fragments of nucleic acid from the nucleotides 4240 to 6412 of SEQ ID no. 1.

[0060] It is also possible to use as a primer, for amplification (for example by PCR), the nucleic acids consisting of the sequences SEQ ID no. 10 to SEQ ID no. 20.

[0061] The probes or primers according to the invention are preferably labeled prior to their use. For this, various techniques are within the scope of the person skilled in the art, such as for example fluorescent, radioactive, chemicoluminescent or enzymatic labeling.

[0062] The methods of diagnosis in vitro in which these oligonucleotides are employed for the detection of mutations or of genomic modifications, at the level of the Erbin gene, are included in the present invention.

[0063] The person skilled in the art knows well the standard methods for analysing the DNA contained in a biological specimen and for diagnosing a genetic disorder. Numerous strategies for genotype analysis are available (Antonarakis et al, 1989; Cooper et al, 1991).

[0064] The DGGE method (denaturing gradient gel electrophoresis), the SSCP method (single strand conformation polymorphism) or the DHPLC method (denaturing high performance liquid chromatography; Kuklin et al, 1997; Huber et al, 1995) may preferably be used for detecting an anomaly in the Erbin gene. Such methods are preferably followed by direct sequencing. The RT-PCR process can be advantageously employed for detecting anomalies in the transcript of the Erbin, since it makes it possible to display the consequences of a splicing mutation causing the loss of one or several exons at the level of the transcript, or an aberrant splicing due to the activation of a cryptic site. This process is also preferably followed by direct sequencing. The most recently developed processes using DNA chips can also be employed for detecting an anomaly in the Erbin gene (Bellis et al, 1997).

[0065] The cloning of the Erbin gene as well as the identification of various mutations responsible for the development of tumours make it possible to envisage direct diagnosis. The present invention therefore relates to the use of at least one nucleic acid such as has been defined previously for the detection of an anomaly in the Erbin gene, defined as comprising a nucleic acid sequence SEQ ID no. 1, no. 3, no. 5 or no. 6 in its transcript.

[0066] Consequently the invention relates to a process for diagnosing in vitro a tumour or a predisposition to develop a tumour, comprising the steps consisting of:

[0067] putting a biological specimen containing DNA or RNA into the presence of specific oligonucleotides permitting amplification of all or part of the Erbin gene or of its transcript, defined as comprising a sequence SEQ ID no. 1, no. 3, no. 5 or no. 6;

[0068] amplifying said DNA or RNA;

[0069] detecting the products of amplification;

[0070] comparing the products of amplification which are obtained with those obtained with a control specimen, and in this way detecting a possible anomaly in the Erbin gene or in its transcript, indicating a predisposition to develop a tumour.

[0071] The biological specimen in question can in particular be blood, or a fragment of tissue taken from a patient.

[0072] Since the expression of Erbin is found on cancerous cells, the methods of evaluating the expression of this protein from suspect tissue samples taken from patients are also particularly advantageous in diagnostic tests in anatomopathology.

[0073] These methods can aim to detect the mRNA coding for Erbin or the protein Erbin itself.

[0074] According to the first embodiment, the invention relates to a method of diagnosing in vitro a tumour or a predisposition to develop a tumour, comprising the steps consisting of:

[0075] putting a biological specimen containing mRNA, obtained by taking a sample of suspect cells from a patient, into the presence of specific oligonucleotides permitting amplification of all or part of the Erbin gene or of its transcript, defined as comprising a sequence SEQ ID no. 1, no. 3, no. 5 or no. 6;

[0076] amplifying said transcript;

[0077] detecting and quantifying the products of amplification;

[0078] a modification in the amount of Erbin transcript with reference to the normal control indicating a tumour or a predisposition to develop a tumour.

[0079] According to a second embodiment, the invention relates to an in vitro method for the detection or measurement of the rate of expression of Erbin in a biological sample, comprising contacting at least one antibody directed against Erbin with said biological sample in conditions permitting the possible formation of specific immunological complexes between Erbin and said antibody or antibodies and the detection of the specific immunological complexes which may be formed.

[0080] The invention relates more particularly to the implementation of the above method for the in vitro diagnosis of a tumour or of a predisposition to develop a tumour on a biological specimen obtained from a sample of suspect cells taken from a patient.

[0081] The presence of the Erbin protein or of the transcript of the gene in quantities which are different from normal may be correlated with a more or less serious prognosis, in terms of the aggressiveness of the tumour for example.

[0082] The invention also relates to antibodies directed against the polypeptide Erbin such as defined previously, or against the specific fragments thereof, such as the fragment defined between the residues 502 to 1278 of the sequence SEQ ID no. 2, or advantageously against the fragment defined between the residues 914 and 1371 of the sequence SEQ ID no. 2.

[0083] These may be polyclonal or monoclonal antibodies or fragments thereof, chimeric antibodies, particularly humanised or immunoconjugated ones.

[0084] The polyclonal antibodies can be obtained from the serum of an animal immunised against a polypeptide according to the usual operating methods.

[0085] According to one embodiment of the invention, it is possible to use as antigen an appropriate peptide fragment, such as has been defined above, capable of being coupled through the intermediary of a reactive residue to a protein or another peptide. Rabbits are immunised with the equivalent of 1 mg of the peptide antigen according to the process described by Benoit et al (1982). At four-weekly intervals the animals are treated by injections of 200 μg of antigen and bled 10 to 14 days later. After the third injection, the antiserum is examined to determine its capacity to bond to the antigenic peptide radio-marked with iodine, prepared by the chloramine-T method and is then purified by carboxymethyl cellulose (CMC) ion exchange column chromatography. The antibody molecules are then collected in the mammals and isolated until the desired concentration is reached using methods well known to the person skilled in the art, for example by using DEAE Sephadex to obtain the IgG fraction.

[0086] In order to increase the specificity of the polyclonal serum, the antibodies can be purified by immunoaffinity chromatography using immunising polypeptides on solid phase. The antibody is contacted with the immunising polypeptide on solid phase for a sufficient duration so as to cause the polypeptide to immunoreact with the antibody molecule in order to form an immunological complex in the solid phase.

[0087] The monoclonal antibodies can be obtained according to the conventional method of hybridome culture described by Köhler and Milstein (1975).

[0088] The antibodies or antibody fragments according to the invention can be for example chimeric antibodies, humanised antibodies, fragments Fab and F(ab′)2. They may also be presented in the form of immunoconjugates or of labeled antibodies.

[0089] The antibodies according to the invention, in particular the monoclonal antibodies, can in particular be used for the analysis by immunohistochemistry of sections of specific tissues, for example by immunofluorescence, gold labeling, immunoperoxidase . . . .

[0090] The antibodies thus produced can be advantageously employed in any situation where the expression of Erbin must be observed.

[0091] The invention relates more generally to the use of at least one antibody thus produced for the detection or the purification of a polypeptide such as has been defined previously in a biological specimen.

[0092] The invention also relates to a kit for carrying out this method, comprising:

[0093] at least one antibody specific for Erbin, possibly fixed on a support;

[0094] means for revealing the formation of specific antigen/antibody complexes between Erbin and said antibody and/or means for quantification of these complexes.

[0095] The invention also relates to a method of obtaining a transgenic non-human animal, preferably a mouse, in which the Erbin gene is invalidated. The animal which can be obtained by this process also forms part of the invention. Such an animal, designated as a “knock-out” for this gene, is a useful model for studying the susceptibility to develop tumours candidates to suppress tumours, or on the other hand to evaluate the cytotoxicity of certain molecules.

[0096] The polypeptide according to the invention or the nucleic acid coding for this polypeptide are useful by way of a medicament, particularly for the treatment of tumours in which for example the expression of a modified Erbin is observed, and for the treatment of those in which it is sought to restore the activity of the wild-type Erbin. These may be any type of tumour, including carcinomas but also the tumours affecting the brain.

[0097] The invention also relates to a pharmaceutical composition comprising a polypeptide such as has been defined previously or a nucleic acid coding for said polypeptide, in association with a pharmaceutically acceptable vehicle.

[0098] The modes of administration, the posologies and the galenic forms of the pharmaceutical compositions according to the invention, containing at least one polypeptide, can be determined in the usual manner by the person skilled in the art, particularly according to the criteria generally taken into account for the establishment of a therapeutic treatment adapted to a patient, such as for example the age or the body weight of the patient, the gravity of the patient's general condition, the tolerance of the treatment and the secondary effects observed, etc.

[0099] As a general rule, a therapeutically or prophylatically effective quantity varying from approximately 0.1 μg to approximately 1 mg can be administered to human adults.

[0100] The invention also relates to a pharmaceutical composition comprising a nucleic acid such as has been defined previously which codes for Erbin and a pharmaceutically acceptable vehicle, said composition being intended to be used in gene therapy. The nucleic acid preferably inserted into a vector, generally a viral vector (such as adenoviruses or retroviruses), can be administered in naked form, free of any vehicle favouring the transfer to the target cell, such as anionic liposomes, cationic lipids, microparticles, for example gold microparticles, precipitating agents, for example calcium phosphate, or any other agent facilitating the transfection. In this case the polynucleotide can be simply diluted in a physiologically acceptable solution, such as a sterile solution or a sterile buffer solution, in the presence of or in the absence of a vehicle.

[0101] Alternatively, a nucleic acid according to the invention can be associated with agents which facilitate the transfection. It may be, inter alia, (i) associated with a chemical agent which modifies the cell permeability such as bupivacaine; (ii) encapsulated in liposomes, possibly in the presence of supplementary substances facilitating the transfection; or (iii) associated with cationic lipids or microparticles of silica, or gold or of tungsten.

[0102] When the nucleic acid structures of the invention cover microparticles, these latter can be injected by the intradermal or intraepidermal route using the “gene gun” technique (WO 94/24263).

[0103] The quantity to be used as a medicament depends particularly upon the nucleic acid structure itself, the individual to whom this nucleic acid is administered, the mode of administration and the type of formulation, and the pathology. As a general rule, a therapeutically of prophylactically effective quantity varying from approximately 0.1 μg to approximately 1 mg, preferably from approximately 1 μg to approximately 800 μg, more preferably from approximately 25 μg to approximately 250 μg, can be administered to human adults.

[0104] The nucleic acid structures of the invention can be administered by any conventional administration route such as in particular parenterally. The choice of administration route depends in particular upon the formulation chosen. Administration targeted at the site of the tumours concerned can be particularly advantageous.

[0105] Finally, the invention relates to a therapeutic treatment method in which an effective quantity of an Erbin polypeptide such as has been defined previously or a nucleic acid coding for this polypeptide is administered within the framework of gene therapy to a patient who needs such a treatment.

[0106] The patient concerned is generally a human being, but the application can also be extended to any mammal if appropriate.

[0107] Another aspect of the invention relates to cancerous tumours in which the ERBB2/HER2 receptor is implicated, an overexpression of this receptor often being observed in these cases. Amongst these tumours may be cited more particularly the carcinomas (epithelial cancers) which affect for example the liver, the kidney, the breast or the colon.

[0108] One strategy can then be to inhibit the interaction between Erbin and the receptor in order to delocalise it.

[0109] Another strategy consists of blocking the expression of the native Erbin, for example by means of antisense oligonucleotides preventing the transcription of the naked RNA of the Erbin gene.

[0110] Therefore the invention also relates to a method of screening such molecules inhibiting the binding of Erbin and ERBB2 receptor, this method comprising contacting a molecule to be tested with

[0111] (i) a first binding partner which is the ERBB2/HER2 receptor or a fragment thereof which fixes to Erbin, and

[0112] (ii) a second binding partner which is Erbin or a fragment thereof which fixes to the receptor,

[0113] said first and/or second binding partner(s) being labeled in such a way as to be detected,

[0114] and evaluating the inhibition of the interaction between the said binding partners by the molecule to be tested.

[0115] In order to carry out these competitive binding tests, for example of the ELISA or IRMA types, a fragment of the ERBB2/HER2 receptor which fixes to Erbin can therefore be used. This fragment comprises at least the nine last C-terminal amino acids of the receptor, namely the sequence EYLGLDVPV represented by SEQ ID no. 21. On the other hand, it is possible to use recombinant Erbin or a fragment thereof, which fixes to the receptor, namely a fragment which comprises the PDZ domain of Erbin.

[0116] One or the other of these binding partners is labeled in a detectable manner, the means for labeling the proteins being well known to the person skilled in the art. This could be a fluorescent labeling agent which binds chemically to the proteins without denaturation in order to form a staining fluorochrome which is a useful immunofluorescent indicator. The labeling agent can also be an enzyme such as peroxidase (HRP) or glucose oxidase. Radioactive elements can also be used as labeling agents.

[0117] In a preferred manner a protein is used (preferably Erbin) which is fused or coupled to a protein of the type of biotin, glutation-S-transferase, polyhistidine . . . .

[0118] One or the other of the binding partners (preferably the receptor) can be immobilised in an advantageous manner on a solid phase (membrane, beads, plastic . . . ).

[0119] The interaction between the binding partners is therefore demonstrated by an enzymatic test, fluorescence, or by autoradiography, . . . .

[0120] The molecules having inhibiting activity which are selected by this method can then be subjected to a test of precipitation of the ERBB2/HER2 receptor produced in the cells of mammals. The receptor is precipitated from a cellular lysate by virtue of the recombinant protein comprising the PDZ domain of Erbin and revealed by Western Blot or by another sufficiently sensitive method.

[0121] The molecules in which the inhibiting activity of the Erbin-receptor interaction is confirmed are retained to be tested on polarised epithelial cells in order to evaluate their capacity to induce delocalisation of the receptor.

[0122] Alternatively, a triple-hybrid test in yeast can be carried out for this purpose (Zhang J. et al, 1996).

[0123] The molecules thus screened represent excellent candidates in a strategy of anti-cancer therapy consisting of inhibiting the interaction between Erbin and ERBB2 in the tumour cells in order to delocalise the receptor and to render it inactive.

[0124] Conversely, in the case where an individual expresses an altered Erbin protein or a mutated ERBB2 receptor such that the Erbin-receptor interaction is diminished relative to a normal control, it may be advantageous to seek activators of this interaction.

[0125] To this end a method is proposed for screening of molecules activating the binding of Erbin and ERBB2 receptor which can be carried out easily on the model of the method for screening of inhibitors described above or by the triple-hybrid technique.

[0126] The invention also relates to a process for identification of mutants of Erbin which increase or reduce the interaction between this protein and ERBB2, in which a mutation is effected on the Erbin and the effect of this mutation on the interaction of the latter with the ERBB2 receptor is evaluated.

[0127] The mutagenesis can be targeted or achieved by chance according to techniques known to the person skilled in the art. The evaluation of the effect of the mutation can be achieved by a test of precipitation of ERBB2 or a double-hybrid test, or even by means of an expression bank.

[0128] In this latter technique, the mutated PDZ domain is expressed by means of phages or bacteria spread over culture boxes. After transfer of the proteins produced onto a membrane of the nitrocellulose type, the peptide of ERBB2 labeled by radioactivity, biotinylation or some other method is incubated with the membrane. The mutants which abolish the interaction and no longer fix the peptide are taken and the nucleotide sequence is analysed in order to know the mutation introduced.

[0129] The invention also relates to the mutants of Erbin which increase or decrease the interaction between the PDZ domain of this protein and ERBB2.

[0130] The mutation of histidine (position 1347) into leucine (mutant HL) or the mutation of the residues leucine (position 1291) into methionine and phenylalanine (position 1293) into isoleucine (mutant MI) cause in particular an increase in the interaction between the PDZ domain of Erbin and ERBB2.

[0131] The mutation of histidine (position 1347) into tyrosine and of glycine (position 1348) into aspartic acid (mutant YD) or the mutation of threonine (position 1316) into asparagine and of arginine (position 1317) into serine (mutant NS) on the other hand cause a decrease in the interaction between the PDZ domain of Erbin and ERBB2.

[0132] The invention also relates to a process for identification of mutants of the ERBB2 receptor which inhibit or which increase the interaction with Erbin, in which a mutation of the ERBB2 receptor is effected and the effect of this mutation on the interaction of this receptor with Erbin is evaluated. The sequence of the human ERBB2 receptor is accessible for example on the EMBL database (access number X03363).

[0133] A double-hybrid test in yeast can be particularly appropriate for this purpose.

[0134] Thus mutations have been introduced in the last 9 carboxy-terminal amino acids of ERBB2 (peptide EYLGLDVPV) (SEQ ID no. 21). The changing of one of the amino acids underlined in the peptide, for example, into alanine, inhibits the interaction with the PDZ domain of Erbin.

[0135] The mutants of ERBB2 capable of being identified by this process also form part of the invention.

[0136] The authors of the present invention have more particularly determined that the mutants of ERBB2 mutated on the positions 5(L) and 6 (D) of SEQ ID no. 21 are incapable of binding Erbin but remain capable of binding other PDZ domains of other proteins, such as PICK1.

[0137] These mutants are in particular useful for discriminating molecules which inhibit the interaction of the PDZ domain of Erbin in screening processes.

[0138] These mutants of Erbin or of the ERBB2 receptor can be used in tests for screening other compounds which inhibit or activate the interaction between Erbin and the ERBB2 receptor. They can also be exploited within the framework of a therapy by administration of these mutated forms or of nucleic acids coding for these mutated forms.

[0139] In particular, the forms which activate the Erbin-ERBB2 interaction can be exploited as a medication which is useful in an individual who expresses an altered Erbin protein or a mutated ERBB2 receptor and in whom the Erbin-receptor interaction is reduced relative to a normal control.

[0140] Conversely, the forms which inhibit the Erbin-ERBB2 interaction can be useful as medication in the case for example of an overexpression of the receptor.

FIGURES

[0141]FIG. 1 displays a Western Blot showing the specific interaction of the PDZ domain of Erbin with ERBB2 but not with EGF-R. The bonded proteins transferred onto nitrocellulose have been revealed by anti-ERBB2 and anti-EGF-R antibodies. One tenth of the lysate was loaded on gel (total lysate TL). The GST alone or the GST-X11α PDZ domain (X11) were used as control. The GST-Erbin polypeptides used are: GST-Erbin (914-1371) lane 1, GST-Erbin (914-1240) lane 2 and PDZ domain of GST-Erbin lane 3.

[0142]FIG. 2 shows a transfer onto a nitrocellulose membrane of a SDS-PAGE gel in which has been loaded a fusion protein formed by the last 9 amino acids of the receptors related to EGF-R fused to the GST protein. The PDZ domain of the soluble GST-Erbin labeled at ³²P or the fusion proteins GST-LIN-7 were used in order to probe the membrane. After incubation, the membranes were washed and the binded proteins were revealed by autoradiography.

[0143]FIG. 3 shows a Western Blot of total lysates of different murine tissues and cell lines subjected to electrophoresis on SDS-PAGE, transfer onto nitrocellulose and contacted with a purified anti-Erbin antibody. The Erbin is a 180 kD protein indicated by the arrows.

EXAMPLES Example 1

[0144] Cloning of Erbin, a Gene Coding for a Protein with PDZ Domain Interacting with the ERBB2 Receptor.

[0145] The ERBB receptors receive their ligands, coming from the stroma, on the basolateral epithelium where they are localised (Klapper et al, 2000). The analysis of the carboxy-terminal peptide sequence of the ERBB receptors has revealed that the ERBB2 receptor contained a site for bonding to a PDZ domain preserved among humans, mice, quails and dogs (Songyang et al, 1997).

[0146] The authors of the invention then screened a library of mouse kidney cDNA using the two-hybrids system with the last 9 amino acids of ERBB2 fused with the binding domain to the GAL4 DNA (“GAL-BD”).

[0147] Two-Hybrids Procedure

[0148] The last nine amino acids of ERBB2 were fused to the sub-unit GAL4-BD using the vector pc97 which carries LEU2. A bank of embryonal mouse kidney cDNA primed with an oligo-dT sequence, cloned in a vector pc86 which carries the marker Trp1 as selection marker was screened using the bait GAL4-BD-ERBB2 and the strain of yeast Y190 according to the protocol with lithium-acetate. Approximately 10⁶ transformants TRP*LEU* were selected on agar plates containing a medium deficient in tryptophan, leucine and histidine during primary screening and contain 10 mM of 3-aminotriazole (3AT). The β-galactosidase activity was tested by transferring the yeasts onto filters during the secondary screening.

[0149] After recovery, the DNA of the selected clones was retransformed in a yeast Y190 containing GAL4-BD-ERBB2 or GAL4-BD fused to control peptides.

[0150] In an alternative manner, the last 15 amino acids of ERBB2, of mutated ERBB2 (mutant VA: mutant in which the valine residue of the C-terminal end of ERBB2, a crucial residue for the interaction at the PDZ domain, has been mutated to alanine) and ERBB4 were fused to the protein LexA using the vector pBTM116 which carries Trp1. The PDZ domain of Erbin was fused to the region GAL4-AD coded by the vector pACT2 which carries Leu. Interactions were achieved in the strain of yeast of L40.

[0151] Procedures Relating to the Proteins

[0152] The cells were washed twice with a cold phosphate buffer solution PBS and lysed in a lysis buffer (50 mM HEPES pH 7.5, 10% glycerol, 150 mM NaCl, 1% triton X-100, 1.5 mM MgCl₂, 1 mM EGTA) complemented with 1 mM of PMSF (phenylmethylsulphonyl fluoride), 10 μg/ml of aprotinine and 10 μg/ml of leupeptine. Sodium orthovanadate with 200 μM final concentration was added to the lysis buffer when the cells had been stimulated with EGF. After centrifugation at 16000 g for 20 minutes, the protein content of the lysate was standardised utilising the protein dosage kit from Bio-Rad. For the immunoprecipitation, the lysates were incubated with antibodies for one night at 4° C. A-agarose protein was added and the immune complexes binded to the beads were recovered after one hour, washed three times in HNTG buffer (50 mM HEPES pH 7.5, 10% glycerol, 150 mM NaCl, 0.1% Triton X-100), boiled in test buffer IX and separated by SDS-PAGE. The transfer and the imunoblot on nitrocellulose using the chemicoluminescence method with the aid of the anti-rabbit HRP antibodies or anti-mouse HRP antibodies were carried out as described by Borg et al, 1996. For the tests known as “Far Western”, the membrane was incubated for two hours at ambient temperature with soluble GST fusion proteins marked with the protein kinase A and ³²PγATP diluted in skimmed milk TBS at 5%, 1 mM DTT (10⁶ cpm/ml). After rinsing in TBS-0.1% Triton X-100 buffer and TBS buffer, the bonded GST was revealed by autoradiography. The cellular transfection, the production of GST and the bonding dosages of GST were carried out as described previously (Borg et al, 1996).

[0153] The fractionated lysates were prepared as follows: the cells were lysed in hypotonic buffer (10 mM Tris-Cl [pH 7.4], 0.2 mM MgCl₂, 5 mM KCl) complemented with 1 mM of PMSF, 10 μg/ml of aprotonine and 10 μg/ml of leupeptine. The lysis was completed by 20 passages in a cold homogeniser (piston B). Saccharose was added to the homogenate, called the cytoplasmic fraction. The deposit was resuspended in a volume of lysis buffer (such as has been described above) equivalent to the volume of cytoplasmic fraction. After 30 minutes on ice, the insoluble debris was removed by centrifugation at 16000 g for 30 minutes at 4° C. The resulting supernatant was called the membranal fraction.

[0154] Results:

[0155] Thus the authors of the invention isolate a partial cDNA clone coding for the last 457 amino acids of a novel protein called Erbin for “ERBB2 interacting protein”. The clone drawn from the kidney bank of GAL-4-BD-ERBB2 contains the residues 914 to 1371 of murine Erbin and is called Erbin (914-1371). The proteins of fusion with GST are produced using the peptide sequences of Erbin (914-1371), Erbin (914-1240) and Erbin (1240-1371). The protein of fusion GST-Erbin (1240-1371) will be designated in the text as the PDZ domain of GST-Erbin.

[0156] The strain of yeast Y190 contransformed by vectors coding for the bait fused to Lex A and the prey fused to GAL4-BD were seeded on Trp-Leu-His medium containing 10 mM of 3AT. In the following table the + signs signify growth on the selective medium (-His) and positive β-galactosidase activity. pACT-Erbin 914-1371 1240-1371 LexA βgal −His Bgal −His ERBB2 + + + + ERBB2.VA − − − − ERBB4 − − − −

[0157] No interaction is observed with mutated ERBB2 on the C-terminal valine residue. The PDZ domain found at the C-terminal end of Erbin is sufficient to bond to ERBB2.

[0158] No bonding was observed with EPHB2, MUSK, PDGFRα and PDGFRβ. The proteins GST-Erbin (914-1371) and GST-Erbin PDZ bond to the chimera EGF-R/ERBB2 (HER 1/2) but not to EGF-R in “pull-down” tests of precipitation of proteins (FIG. 1). The domains GST-DLG, LIN-2 and X11α do not bond to HER 1/2. The direct binding of the PDZ domain of Erbin to the carboxy-terminal ends of the receptors ERBB was evaluated by “Far Western” test. The PDZ domain of the radiomarked Erbin bonds to the peptide ERBB2 but not to the peptide EGF-R, ERBB3 and ERBB4 and only bonds weakly to the peptide LET-23 (FIG. 2). The PDZ domain of LIN-7 bonds solely to LET-23. The PDZ domain of Erbin has a strong identity with the PDZ domain of DENSIN-180 (71%) and a 35% identity with the PDZ domains of class I (Songyang et al, 1997; Fanning et al, 1998) found in the proteins LIN-7 and PSD-95 according to the alignment of the sequences. The substitution of a His-Gly motif present in the αB motif of Erbin into a Tyr-Asp motif present in the PDZ domain of class III of NOS inhibits the interaction with ERBB2. These results show that the PDZ domain of Erbin interacts specifically and directly with ERBB2.

[0159] A cDNA of full length coding for human Erbin was cloned by RT-PCR from Daudi, a cellular line of human B lymphocytes, using the MARATHON kit, according to the recommendations of the manufacturer (Clontech). An open reading frame by stop codons in the three frames codes for a protein of 1371 amino acids. Erbin contains 16 LRR motifs in its amino-terminal end (residues 48-416) and one single C-terminal PDZ domain. A wide intermediate region of 863 amino acids between the LRR and PDZ domains contains extensions rich in proline which can represent binding sites or SH3 and WW domains.

Example 2

[0160] Characterisation of Erbin, Member of a New PDZ Family

[0161] A specific human Erbin probe has permitted detection of a transcript of 7.2 kb in the majority of human and murine tissues tested. This analysis was carried out by Northern Blot with the aid of the “Multiple Tissues” technique of Clontech, from 2 μg of mRNA poly(A)⁺ isolated from various human tissues. Specific antibodies against Erbin were prepared.

[0162] Preparation of the Antibodies

[0163] The monoclonal antibody anti-Mc 9E10 (Oncogene Research Products, Cambridge, Mass.) was used for the immunoprecipitation and the immunoblot. Monoclonal antibodies of rabbits anti-ERBB2 and anti-EGF-R were obtained by injection of peptides into rabbits. The monoclonal antibody 408 anti-EGF-R was used for the immunoprecipitation. The monoclonal antibody anti-EGF-R (clone LA22) of Upstate Biotechnology (UBI) was used for the immunofluorescence. The polyclonal antibodies anti-SHC and monoclonal antibody 4G10 mAb (anti-PY) were obtained from UBI. Anti-rabbit and anti-mouse goat IgG coupled to a horseradish peroxidase were purchased from Laboratoire Jackson and Dako respectively.

[0164] A polyclonal anti-Erbin rabbit antibody was produced by injecting a fusion protein GST-Erbin (914-1371). The anti-Erbin antibody was also purified by affinity with a histidine labeled Erbin (914-1371) binded to beads of agarose and of nickel (Qiagen).

[0165] These specific antibodies against Erbin detected a protein of 180 kD in the form of a doublet in the brain, the liver, the kidney, the spleen, the intestines and the skeletal muscles as well as in the epithelial cell lines Caco-2 and MDCK (FIG. 3). The expression of cDNA of Erbin of full length in the COS cells permitted the production of a protein of similar size to the protein detected by the antibody in question in the tissues and the cell extracts.

[0166] Cell Culture

[0167] The cells COS-1 and MDCK are cultivated in DMEM medium (Dulbecco modified Eagle medium) containing 100 U/ml of penicillin and 100 μg/ml of streptomycin sulphate, complemented with 10% foetal calf serum (FCS). The Caco-2 cells were kept in DMEM medium complemented with 20% of FCS and 1% of non-essential amino acids. All the cellular transfections were carried out using a reactant FIG. 6 according to the recommendations of the manufacturer (Boehringer Mannheim).

[0168] The structural homology between Erbin, DENSIN-180, VARTUL, KAA0147 and F26D11.11 led the authors of the invention to consider these proteins as members of a new PDZ family designated LAP for “LRR and PDZ domain proteins”.

Example 3

[0169] Interaction of Erbin with an Activated HER1/2 Receptor

[0170] The interaction in vivo between Erbin and the ERBB2 receptor was tested by co-immunoprecipitation in COS cells.

[0171] 1) HER1/2 was coexpressed transitorily with the polypeptides Erbin, SAP97 or PSD-95 fused to the epitope myc in COS-1 cells. After a lysis, HER1/2 was immunoprecipitated with the antibody anti-EGF-R (clone 408) and the binded proteins were revealed by anti-myc and anti-ERBB2 antibodies respectively. Only Erbin was associated with HER1/2.

[0172] 2) HER1/2 and EGF-R were coexpressed transitorily with Erbin in the COS-1 cells. After treatment on the EGF the cells were lysed and an aliquot fraction of a protein extract was deposited on SDS-PAGE, subjected to electrophoresis and transferred onto a nitrocellulose membrane. An equal quantity of Erbin and of receptor was found in the lysates. The receptors were immunoprecipitated with an anti-EGF-R antibody, the binded proteins were subjected to a Western Blot and revealed with anti-Erbin and anti-receptor antibodies. Only Her1/2 interacted with Erbin. As control, the protein p52 SHC was immunoprecipitated in an increasing manner with HER1/2 and EGF-R after stimulation by EGF.

[0173] 3) HER1/2 and EGF-R were expressed transitorily in the COS-1 cells. After 5 minutes of stimulation with medium containing no growth factors or containing 200 ng/ml EGF, the cells were lysed, and “pull-down” GST tests of precipitation of proteins were carried outusing the GST-SHC PTB or GST-ERBIN PDZ domains fixed on beads of agarose. the proteins precipitated were revealed by a Western Blot analysis using an anti-PY antibody (higher screens). After dehybridisation of the antibodies, the membranes were revealed with anti-ERBB2 and anti-EGF-R antibodies respectively (anti-RTK). Whist only the phosphorylated receptors binded to the SHC PTB domain, non phosphorylated HER1/2 interacted with the PDZ domain of Erbin.

[0174] 4) HER1/2 containing a mutation of the carboxy-terminal valine (mutant VA) or a “kinase-dead” HER1/2, that is to say an HER1/2 deprived of catalytic activity (mutant KA), were expressed in COS-1 cells and the interaction with GST fusion proteins was tested as described previously. The VA mutation in HER1/2 did not affect the phosphorylation of tyrosine residues to the receptor whilst the mutant KA was not phosphorylated on tyrosines. As expected, the mutant HER1/2 KA did not bind to the SHC PTB domain (FIG. 3d). On the other hand, the PDZ domain of Erbin precipitates HER1/2 KA in an effective manner but not via HER1/2 VA. Taken as a whole, these results show that the PDZ domain of Erbin interacts solely with the non-activated HER1/2 receptor.

[0175] The authors of the invention wondered whether the proteins behaved in an identical manner in vivo by coexpressing Erbin labeled by myc with HER1/2 in the COS cells. It was found that the pool of HER1/2 associated with Erbin was not phosphorylated at the site of the tyrosine after stimulation with EGF. On the other hand, the SHC proteins interacted with the phosphorylated receptor. Erbin is a substrate for the ERBB2 kinase and no phosphorylation of Erbin was found during the expression of the HER1/2 KA receptor. These data show that Erbin interacts with non-phosphorylated HER1/2 on tyrosine and identify Erbin as a substrate of the ERBB2 kinase. As a SHC PTB binding site was found at the C-terminal end of ERBB2, two residues upstream of the binding site to the PDZ domain (Borg et al, 1978), the recruitment of the SHC proteins at this site, after stimulation with EGF, could prevent the fixation of the PDZ domain of Erbin. The phosphorylation of the tyrosine in itself is not sufficient to alter the interaction of the HER1/2 PDZ domain since the peptide phospshorylated at the level of the tyrosine inhibits this interaction as effectively as a non-phosphorylated peptide. The interaction of the PDZ domain is therefore capable of being inhibited by competition with protein modules of the signalling machinery.

Example 4

[0176] The Basolateral Localisation of ERBB2 in the Epithelial Cells is Dependent upon the Site of Interaction with Erbin

[0177] In order to make a closer study of the sub-cellular localisation of ERBB2 and of Erbin, the authors of the invention used the Caco-2 line, a cell line of human colon carcinoma, expressing both proteins.

[0178] The fractionation of the Caco-2 cells showed that Erbin was principally present in the membrane fraction whilst the SHC proteins were predominant in the cytolosic fraction. Immunostaining was then carried out on the Caco-2 cells polarised with a purified anti-Erbin antibody. For that, the Caco-2 cells were seeded on glass strips, permeabilised and stained with the indicated primary antibodies, then subjected to staining with the secondary antibodies conjugated to different fluorochromes according to the following protocol.

[0179] Immunolocalisation and Labeling of the Cell Surfaces

[0180] For the immunostaining procedures, the MDCK cells were cultivated on glass strips for three days after confluence and were labeled as described in Le Bivic et al, 1989, with a monoclonal antibody against EGF-R and a polyclonal antibody against gp114 (apical marker of the MDCK cells) (Le Bivic et al, 1990). The Caco-2 cells were cultivated on glass strips for 10 days after confluence in order to ensure a complete differentiation, then were subjected to the same treatment as the MDCK cells. The antibody against Ag525 (basolateral marker of human intestinal cells) was described previously (Le Bivic et al, 1988). Frozen sections (0.5 to 1 μm) of human colon were obtained as described previously (Le Bivic et al, 1988).

[0181] For the biotinylation of the cell surfaces, the cells cultivated on filters for three days after confluence were labeled for one night with “³⁵S-promix ready vue” from Amersham (18.5 MBq/ml) on the basolateral face. The cells were labeled with Sulfo-NHS-LC-biotin (Pierce) with 90 minutes chase in normal medium either on the apical face or on the basolateral face and EGF-R or the chimeras were immunoprecipitated with streptavidine as described in Le Bivic et al, 1989. The specimens were analysed by SDS-PAGE and visualised by fluorography. The autoradiograms were quantified using Intelligent Quantifier from Bioimage (Ann Arbor, Mich.).

[0182] Erbin was found on the basolateral face of the plasmic membrane of Caco-2 since the colocalisation was obtained by co-staining with the monoclonal antibody 525, specific marker of the basolateral epithelium.

[0183] Moreover, ERBB2 was also basolateral and colocalised with Erbin. The staining of section of human colon with the anti-Erbin antibody confirmed the basolateral localisation of the protein in these tissues.

[0184] The authors of the invention then produced MDCK cells expressing EGF-R, HER1/2 and HER1/2.VA in a stable manner. MDCK expressed low levels of endogenous EGF-R and ERBB2 receptors contrary to the cell populations transfected with the constructs, which permitted an expression of EGF-R and HER1/2 (initial and mutant).

[0185] Immunofluorescence experiments were carried out with LA22, which is a specific human monoclonal anti-EGF-R antibody. EGF-R and HER1/2 were localised on the basolateral face whilst the mutation of HER1/2 (HER1/2.VA) prevented the basolateral localisation of HER1/2. The biotinylation of the cell surface was carried out on the apical and basolateral faces of the transfectants MDCK and the quantities of receptors were quantified by autoradiography. Whilst EGF-R and HER1/2 were localised at 83% of the basolateral side, the mutant HER1/2.VA was only 53% basolateral. Thus the elimination of the site of interaction of Erbin led to a depeolarisation of ERBB2 in the epithelial cells.

Example 5

[0186] Production of a “Knock-Out” Mouse

[0187] The strategy consists of eliminating the first exon from Erbin gene coding for the first 63 amino acids of Erbin and replacing it by a cassette coding for the gene for neomycin resistance. The disappearance of the translation initiation codon and of the following 62 amino acids causes a loss of expression of the functional Erbin protein.

[0188] The homologous recombination vector of pPNT3 type substitutes sequences of the Erbin gene (exon 1) by a cassette of gene for neomycin resistance. It comprises:

[0189] the gene for resistance to neomycin (neo^(r)) provided with two recognised promoters in eukaryote and prokaryote cells.

[0190] the PGK-TK cassette is constituted by the minimal promoter of phosphoglycerate kinase (PGK) and sequences coding the thymidine kinase of the Herpes simplex virus. The PGK promoter is normally activated in the ES cells and permits a negative selection to be made in the course of the selection of the recombinant clones. This cassette is placed in 5′ of the regions of homology of the Erbin. The recombination vector is constructed from 6 kb of genomic sequence of Erbin: 1.5 kb upstream of the first exon (5′ arm) and 4.5 kb (3′ arm) downstream of the first exon. The final recombination vector has the following DNA sequences in series: 5′ Cassette PGK-TK—arm 5′ Erbin—neo^(r) gene—arm 3′ Erbin 3′. After linearisation of the recombination vector, this latter is transferred by electroporation into ES 129 cells (embryonic stem cells) and the transfected cells are selected by addition of geneticine into the culture medium, according to the protocol described in “Gene targeting: a practical approach”, Editions A. L. Joyner 1993 IRL Press. In order to demonstrate the clones of which the genome includes the event of homologous recombination, a counter-selection of ganciclovir is used. The clones are verified according to the protocol proposed in “Gene targeting: a practival approach”, Editions A. L. Joyner 1993 IRL Press, by PCR and Southern Blot, and Fiore et al, Int. J. Dev. Biol. 41: 639-642 (1997).

[0191] The selected clones are injected into C57BL/6 mouse blastocytes taken at 3.5 days of gestation. The blastocytes are reimplanted into pseudogestant SWISS mice of 2.5 days. The chimeras are demonstrated by the colour of their fur (black and brown for the chimeras). The chimeric males are crossed with C57BL/6 females and the tails of agouti newborns are taken and their genotype analysed by Southern Blot in order to detect the recombined allele. The crossing of heterozygotic mice amongst themselves makes it possible to obtain homozygotic mutants for the mutation (generation F2) (verification by Southern Blot).

Example 6

[0192] Mutagenesis of the PDZ Domain of Erbin

[0193] Punctual mutations effected in the PDZ domain of Erbin have inverse effects on the interaction with ERBB2/HER2.

[0194] In order to show the effect of these mutations, the following two techniques in particular can be used: precipitation of ERBB2 by recombinant fusion proteins and double-hybrid system in yeast.

[0195] Wild-type or mutated recombinant fusion proteins GST-PDZ ERBIN fixed to solid matrix are incubated with a cellular lysate containing ERBB2/HER2. After two hours, the beads are washed and the complexes resolved by SDS-PAGE. After transfer, ERBB2 is revealed by a specific antibody (Western Blot).

[0196] For the double-hybrid system in yeast, the bait is constituted by LEXA-BD fused to the ERBB2 peptide. The prey is constituted by GAL4-AD fused to the PDZ domain of wild-type or mutated Erbin.

[0197] These two tests gave identical results: mutants PDZ YD (of histidine 1347 into tyrosine and of glycine (position 1348) into aspartic acid) and NS (of threonine 1316 into asparagine and of arginine (position 1317) into serine) have less affinity for the ERBB2 receptor and mutants HL (of histidine (position 1347) into leucine) and MI (of leucine 1291 into methionine and of phenylalanine (position 1293) into isoleucine) have more affinity for ERBB2.

BIBLIOGRAPHICAL REFERENCES

[0198] Antonarakis S. E., Diagnosis of genetic disorders at the DNA level. N Engl J. Med. 320: 153-163 (1989).

[0199] Bellis et al, medecine/sciences, 13:1317-24 (1997).

[0200] Benoit et al, PNAS USA, 79, 917-921 (1982).

[0201] Borg et al (1996), The Phosphotyrosine Interaction Domains of X11 and FE65 bind to distinct sites on the YENPTY motif of amyloid precursor protein. Molecular and Cellular Biology 16: 6229-6241.

[0202] Borg et al (1998), Function of PTB domains. Curr. Top. Microbiol. Immunol. 228:23-38.

[0203] Borg et al (1998), Identification of an evolutionarily conserved heterotrimeric protein complex involved in protein targeting. J. Biol. Chem. 273(48): 31633-31636.

[0204] Cooper et al, Diagnosis of genetic disease using recombinant DNA, 3^(rd) Edition, Hum Genet., 87: 519-560 (1991).

[0205] Fanning, A. S. & Anderson, J. M. PDZ domains and the formation of protein networks at the plasma membrane. Curr. Top. Microbiol. Immunol. 228: 209-233 (1998).

[0206] Huber, C. G. et al, Rapid and accurate sizing of DNA fragments by ion-pair chromatography on alkylated nonporous poly(styrenedivinylbenzene) particles. Anal. Chem. 67, 578-585 (1995).

[0207] Klapper et al (2000), Biochemical and clinical implications of the ERBB/HER signaling network of growth factor receptors. Adv Cancer Res 77: 25-79.

[0208] Köhler and Milstein, Nature, 256, 495-497 (1975).

[0209] Kuklin, A. et al, Detection of single-nucleotide polymorphisms with the WAVE™ DNA fragment analysis system. Genetic Testing 1, 201-206 (1997/98).

[0210] Le Bivic et al (1988) Characterization of a glycoprotein expressed on the basolateral membrane of human intestinal epithelial cells and cultured colonic cell lines. Eur J Cell Biol 46: 113-120.

[0211] Le Bivic et al (1989) Vectorial targeting of apical and basolateral plasma membrane proteins in a human adenocarcinoma epithelial cell line. Proc Natl Acad Sci USA 86: 9313-9317.

[0212] Le Bivic et al (1990) Vectorial targeting of an endogenous apical membrane sialoglycoprotein and uvomorulin in MDCK cells. J Cell Biol 110: 1533-1539.

[0213] Ross et al (1999), HER-2/neu (c-erb-B2) gene and protein in breast cancer, Am. J. Clin. Pathol. 112: S53-67.

[0214] Sambrook et al, Molecular cloning, a laboratory manual, Spring Harbor Laboratory Press, 9.54-62 (1989).

[0215] Songyang et al (1997) Recognition of unique carboxyl-terminal motifs by distinct PDZ domains. Science 275: 73-77.

[0216] Zhang et al (1996) A yeast three-hybrid-method to clone ternary protein complex components. Analytical Biochemistry, 242, 68-72.

[0217]

1 21 1 6412 DNA Homo sapiens CDS (324)..(4436) 1 aaagatcttt tttttttttt tttctttttt tttcggcgga gatcctcgtt ggggctggga 60 aactcctgca aaactcgaga ccaggaagcc agcccgcacc ccaaccccca ccaaagccac 120 ctactcttct tctgtgggag gccagtccac atccgctctc acccgagaga gatattcagc 180 tggatccaaa gtgactgatg aagggaagga aatcatgtca agcgaagcct tgaaaaagct 240 gccctgagac ggtgtcccgc cgaaagaatg ttggctcaat taagaaacat cagggagata 300 aattcaaccc agtgtgtcta aaa atg act aca aaa cga agt ttg ttt gtg cgg 353 Met Thr Thr Lys Arg Ser Leu Phe Val Arg 1 5 10 ttg gta cca tgt cgc tgt cta cga ggg gaa gag gag act gtc act act 401 Leu Val Pro Cys Arg Cys Leu Arg Gly Glu Glu Glu Thr Val Thr Thr 15 20 25 ctt gat tat tct cat tgc agc tta gaa caa gtt ccg aaa gag att ttt 449 Leu Asp Tyr Ser His Cys Ser Leu Glu Gln Val Pro Lys Glu Ile Phe 30 35 40 act ttt gaa aaa acc ttg gag gaa ctc tat tta gat gct aat cag att 497 Thr Phe Glu Lys Thr Leu Glu Glu Leu Tyr Leu Asp Ala Asn Gln Ile 45 50 55 gaa gag ctt cca aag caa ctt ttt aac tgt cag tct tta cac aag ctg 545 Glu Glu Leu Pro Lys Gln Leu Phe Asn Cys Gln Ser Leu His Lys Leu 60 65 70 agt ttg cca gac aat gat tta aca acg tta cca gca tcc att gca aac 593 Ser Leu Pro Asp Asn Asp Leu Thr Thr Leu Pro Ala Ser Ile Ala Asn 75 80 85 90 ctt att aat ctc agg gaa ctg gat gtc agc aag aat gga ata cag gag 641 Leu Ile Asn Leu Arg Glu Leu Asp Val Ser Lys Asn Gly Ile Gln Glu 95 100 105 ttt cca gaa aat ata aaa aat tgt aaa gtt ttg aca att gtg gag gcc 689 Phe Pro Glu Asn Ile Lys Asn Cys Lys Val Leu Thr Ile Val Glu Ala 110 115 120 agt gta aac cct att tcc aag ctc cct gat gga ttt tct cag ctg tta 737 Ser Val Asn Pro Ile Ser Lys Leu Pro Asp Gly Phe Ser Gln Leu Leu 125 130 135 aac cta acc cag ttg tat ctg aat gat gct ttt ctt gag ttc ttg cca 785 Asn Leu Thr Gln Leu Tyr Leu Asn Asp Ala Phe Leu Glu Phe Leu Pro 140 145 150 gca aat ttt ggc aga tta act aaa ctc caa ata tta gag ctt aga gaa 833 Ala Asn Phe Gly Arg Leu Thr Lys Leu Gln Ile Leu Glu Leu Arg Glu 155 160 165 170 aac cag tta aaa atg ttg cct aaa act atg aat aga ctg acc cag ctg 881 Asn Gln Leu Lys Met Leu Pro Lys Thr Met Asn Arg Leu Thr Gln Leu 175 180 185 gaa aga ttg gat ttg gga agt aac gaa ttc acg gaa gtg cct gaa gta 929 Glu Arg Leu Asp Leu Gly Ser Asn Glu Phe Thr Glu Val Pro Glu Val 190 195 200 ctt gag caa cta agt gga ttg aaa gag ttt tgg atg gat gct aat aga 977 Leu Glu Gln Leu Ser Gly Leu Lys Glu Phe Trp Met Asp Ala Asn Arg 205 210 215 ctg act ttt att cca ggg ttt att ggt agt ttg aaa cag ctc aca tat 1025 Leu Thr Phe Ile Pro Gly Phe Ile Gly Ser Leu Lys Gln Leu Thr Tyr 220 225 230 ttg gat gtt tct aaa aat aat att gaa atg gtt gaa gaa gga att tca 1073 Leu Asp Val Ser Lys Asn Asn Ile Glu Met Val Glu Glu Gly Ile Ser 235 240 245 250 aca tgt gaa aac ctt caa gac ctc cta tta tca agc aat tca ctt cag 1121 Thr Cys Glu Asn Leu Gln Asp Leu Leu Leu Ser Ser Asn Ser Leu Gln 255 260 265 cag ctt cct gag act att ggt tcg ttg aag aat ata aca acg ctt aaa 1169 Gln Leu Pro Glu Thr Ile Gly Ser Leu Lys Asn Ile Thr Thr Leu Lys 270 275 280 ata gat gaa aac cag tta atg tat ctg cca gac tct ata gga ggg tta 1217 Ile Asp Glu Asn Gln Leu Met Tyr Leu Pro Asp Ser Ile Gly Gly Leu 285 290 295 ata tca gta gaa gaa ctg gat tgt agt ttc aat gag gtt gaa gct ttg 1265 Ile Ser Val Glu Glu Leu Asp Cys Ser Phe Asn Glu Val Glu Ala Leu 300 305 310 cct tca tct att ggg cag ctt act aac tta aga act ttt gct gct gat 1313 Pro Ser Ser Ile Gly Gln Leu Thr Asn Leu Arg Thr Phe Ala Ala Asp 315 320 325 330 cat aat tac tta cag cag ttg ccc cca gag att gga agc tgg aaa aat 1361 His Asn Tyr Leu Gln Gln Leu Pro Pro Glu Ile Gly Ser Trp Lys Asn 335 340 345 ata act gtg ctg ttt ctc cat tcc aat aaa ctt gag aca ctt cca gag 1409 Ile Thr Val Leu Phe Leu His Ser Asn Lys Leu Glu Thr Leu Pro Glu 350 355 360 gaa atg ggt gat atg caa aaa tta aaa gtc att aat tta agt gat aat 1457 Glu Met Gly Asp Met Gln Lys Leu Lys Val Ile Asn Leu Ser Asp Asn 365 370 375 aga tta aag aat tta ccc ttt agc ttt aca aag cta cag caa ttg aca 1505 Arg Leu Lys Asn Leu Pro Phe Ser Phe Thr Lys Leu Gln Gln Leu Thr 380 385 390 gct atg tgg ctc tca gat aat cag tcc aaa ccc ctg ata cct ctt caa 1553 Ala Met Trp Leu Ser Asp Asn Gln Ser Lys Pro Leu Ile Pro Leu Gln 395 400 405 410 aaa gaa act gat tca gag acc cag aaa atg gtg ctt acc aac tac atg 1601 Lys Glu Thr Asp Ser Glu Thr Gln Lys Met Val Leu Thr Asn Tyr Met 415 420 425 ttc cct caa cag cca agg act gag gat gtt atg ttt ata tca gat aat 1649 Phe Pro Gln Gln Pro Arg Thr Glu Asp Val Met Phe Ile Ser Asp Asn 430 435 440 gaa agt ttt aac cct tca ttg tgg gag gaa cag agg aaa cag cgg gct 1697 Glu Ser Phe Asn Pro Ser Leu Trp Glu Glu Gln Arg Lys Gln Arg Ala 445 450 455 caa gtt gca ttt gaa tgt gat gaa gac aaa gat gaa agg gag gca cct 1745 Gln Val Ala Phe Glu Cys Asp Glu Asp Lys Asp Glu Arg Glu Ala Pro 460 465 470 ccc agg gag gga aat tta aaa aga tat cca aca cca tac cca gat gag 1793 Pro Arg Glu Gly Asn Leu Lys Arg Tyr Pro Thr Pro Tyr Pro Asp Glu 475 480 485 490 ctt aag aat atg gtc aaa act gtt caa acc att gta cat aga tta aaa 1841 Leu Lys Asn Met Val Lys Thr Val Gln Thr Ile Val His Arg Leu Lys 495 500 505 gat gaa gag acc aat gaa gac tca gga aga gat ttg aaa cca cat gaa 1889 Asp Glu Glu Thr Asn Glu Asp Ser Gly Arg Asp Leu Lys Pro His Glu 510 515 520 gat caa caa gat ata aat aaa gat gtg ggt gtg aag acc tca gaa agt 1937 Asp Gln Gln Asp Ile Asn Lys Asp Val Gly Val Lys Thr Ser Glu Ser 525 530 535 act act aca gta aaa agc aaa gtt gat gaa aga gaa aaa tat atg ata 1985 Thr Thr Thr Val Lys Ser Lys Val Asp Glu Arg Glu Lys Tyr Met Ile 540 545 550 gga aac tct gta cag aag atc agt gaa cct gaa gct gag att agt cct 2033 Gly Asn Ser Val Gln Lys Ile Ser Glu Pro Glu Ala Glu Ile Ser Pro 555 560 565 570 ggg agt tta cca gtg act gca aat atg aaa gcc tct gag aac ttg aag 2081 Gly Ser Leu Pro Val Thr Ala Asn Met Lys Ala Ser Glu Asn Leu Lys 575 580 585 cat att gtt aac cat gat gat gtt ttt gag gaa tct gaa gaa ctt tct 2129 His Ile Val Asn His Asp Asp Val Phe Glu Glu Ser Glu Glu Leu Ser 590 595 600 tct gat gaa gag atg aaa atg gcg gag atg cga cca cca tta att gaa 2177 Ser Asp Glu Glu Met Lys Met Ala Glu Met Arg Pro Pro Leu Ile Glu 605 610 615 acc tct att aac cag cca aaa gtc gta gca ctt agt aat aac aaa aaa 2225 Thr Ser Ile Asn Gln Pro Lys Val Val Ala Leu Ser Asn Asn Lys Lys 620 625 630 gat gat aca aag gaa aca gat tct tta tca gat gaa gtt aca cac aat 2273 Asp Asp Thr Lys Glu Thr Asp Ser Leu Ser Asp Glu Val Thr His Asn 635 640 645 650 agc aat cag aat aac agc aat tgt tct tct cca tct cgg atg tct gat 2321 Ser Asn Gln Asn Asn Ser Asn Cys Ser Ser Pro Ser Arg Met Ser Asp 655 660 665 tca gtt tct ctt aat act gat agt agt caa gac acc tca ctc tgc tct 2369 Ser Val Ser Leu Asn Thr Asp Ser Ser Gln Asp Thr Ser Leu Cys Ser 670 675 680 cca gtg aaa caa act cat att gat att aat tcc aaa atc agg caa gaa 2417 Pro Val Lys Gln Thr His Ile Asp Ile Asn Ser Lys Ile Arg Gln Glu 685 690 695 gat gaa aat ttt aac agc ctt tta caa aat gga gat att tta aac agt 2465 Asp Glu Asn Phe Asn Ser Leu Leu Gln Asn Gly Asp Ile Leu Asn Ser 700 705 710 tca aca gag gaa aag ttc aaa gct cat gat aaa aaa gat ttt aac tta 2513 Ser Thr Glu Glu Lys Phe Lys Ala His Asp Lys Lys Asp Phe Asn Leu 715 720 725 730 cct gaa tat gat ttg aat gtt gaa gag cga tta gtt cta att gag aaa 2561 Pro Glu Tyr Asp Leu Asn Val Glu Glu Arg Leu Val Leu Ile Glu Lys 735 740 745 agt gtt gac tca aca gcc aca gct gat gac act cac aaa tta gat cat 2609 Ser Val Asp Ser Thr Ala Thr Ala Asp Asp Thr His Lys Leu Asp His 750 755 760 atc aat atg aat ctt aat aaa ctt ata act aat gat aca ttt caa cca 2657 Ile Asn Met Asn Leu Asn Lys Leu Ile Thr Asn Asp Thr Phe Gln Pro 765 770 775 gag atc atg gaa aga tca aaa aca cag gat att gtg ctt gga aca agc 2705 Glu Ile Met Glu Arg Ser Lys Thr Gln Asp Ile Val Leu Gly Thr Ser 780 785 790 ttt tta agc att aat tct aaa gag gaa act gag cac ttg gaa aat gga 2753 Phe Leu Ser Ile Asn Ser Lys Glu Glu Thr Glu His Leu Glu Asn Gly 795 800 805 810 aac aag tat cct aat ttg gaa tcc gta aat aag gta aat gga cat tct 2801 Asn Lys Tyr Pro Asn Leu Glu Ser Val Asn Lys Val Asn Gly His Ser 815 820 825 gag gaa act tcc cag tct cct aat agg act gaa cca cat gac agt gat 2849 Glu Glu Thr Ser Gln Ser Pro Asn Arg Thr Glu Pro His Asp Ser Asp 830 835 840 tgt tct gtt gac tta ggt att tcc aaa agc act gaa gat ctc tcc cct 2897 Cys Ser Val Asp Leu Gly Ile Ser Lys Ser Thr Glu Asp Leu Ser Pro 845 850 855 cag aaa agt ggt cca gtt gga tct gtt gtg aaa tct cat agc ata act 2945 Gln Lys Ser Gly Pro Val Gly Ser Val Val Lys Ser His Ser Ile Thr 860 865 870 aat atg gag att gga ggg cta aaa atc tat gat att ctt agt gat aat 2993 Asn Met Glu Ile Gly Gly Leu Lys Ile Tyr Asp Ile Leu Ser Asp Asn 875 880 885 890 gga cct cag cag cca agt aca acc gtt aaa atc aca tct gct gtt gat 3041 Gly Pro Gln Gln Pro Ser Thr Thr Val Lys Ile Thr Ser Ala Val Asp 895 900 905 gga aaa aat ata gtc agg agc aag tct gcc aca ctg ttg tat gat caa 3089 Gly Lys Asn Ile Val Arg Ser Lys Ser Ala Thr Leu Leu Tyr Asp Gln 910 915 920 cca ttg cag gta ttt act ggt tct tcc tca tct tct gat tta ata tca 3137 Pro Leu Gln Val Phe Thr Gly Ser Ser Ser Ser Ser Asp Leu Ile Ser 925 930 935 gga aca aag gca att ttc aag ttt gat tca aat cat aat ccc gaa gag 3185 Gly Thr Lys Ala Ile Phe Lys Phe Asp Ser Asn His Asn Pro Glu Glu 940 945 950 cca aat ata ata aga ggc ccc aca agt ggc cca caa tct gca cct caa 3233 Pro Asn Ile Ile Arg Gly Pro Thr Ser Gly Pro Gln Ser Ala Pro Gln 955 960 965 970 ata tat ggt cct cca cag tat aat atc caa tac agt agc agt gct gca 3281 Ile Tyr Gly Pro Pro Gln Tyr Asn Ile Gln Tyr Ser Ser Ser Ala Ala 975 980 985 gtc aaa gac act ttg tgg cac tcc aaa caa aat ccc caa ata gac cat 3329 Val Lys Asp Thr Leu Trp His Ser Lys Gln Asn Pro Gln Ile Asp His 990 995 1000 gcc agt ttt cct cct cag ctc ctt cct aga tca gag agc aca gaa aat 3377 Ala Ser Phe Pro Pro Gln Leu Leu Pro Arg Ser Glu Ser Thr Glu Asn 1005 1010 1015 caa agt tat gct aaa cat tct gcc aat atg aat ttc tct aat cat aac 3425 Gln Ser Tyr Ala Lys His Ser Ala Asn Met Asn Phe Ser Asn His Asn 1020 1025 1030 aat gtt cga gct aat act gca tac cat tta cat cag aga ctt ggc cca 3473 Asn Val Arg Ala Asn Thr Ala Tyr His Leu His Gln Arg Leu Gly Pro 1035 1040 1045 1050 gca aga cat ggg gaa atg tgg gcc atc tca cca aac gac cga ctt att 3521 Ala Arg His Gly Glu Met Trp Ala Ile Ser Pro Asn Asp Arg Leu Ile 1055 1060 1065 cct gca gta act cga agt aca atc cag cga caa agt agt gtg tcc tcc 3569 Pro Ala Val Thr Arg Ser Thr Ile Gln Arg Gln Ser Ser Val Ser Ser 1070 1075 1080 aca gcc tct gta aat ctt ggt gat cca ggc tct aca agg cgg gct cag 3617 Thr Ala Ser Val Asn Leu Gly Asp Pro Gly Ser Thr Arg Arg Ala Gln 1085 1090 1095 att cct gaa gga gat tat tta tca tac aga gag ttc cac tca gcg gga 3665 Ile Pro Glu Gly Asp Tyr Leu Ser Tyr Arg Glu Phe His Ser Ala Gly 1100 1105 1110 aga act cct cca atg atg cca gga tca cag aga ccc ctt tct gca cga 3713 Arg Thr Pro Pro Met Met Pro Gly Ser Gln Arg Pro Leu Ser Ala Arg 1115 1120 1125 1130 aca tac agc ata gat ggt cca aat gca tca aga cct cag agt gct cga 3761 Thr Tyr Ser Ile Asp Gly Pro Asn Ala Ser Arg Pro Gln Ser Ala Arg 1135 1140 1145 ccc tct att aat gaa ata cca gag aga act atg tca gtt agt gat ttc 3809 Pro Ser Ile Asn Glu Ile Pro Glu Arg Thr Met Ser Val Ser Asp Phe 1150 1155 1160 aat tat tca cgg act agt cct tca aaa aga cca aat gca agg gtt ggt 3857 Asn Tyr Ser Arg Thr Ser Pro Ser Lys Arg Pro Asn Ala Arg Val Gly 1165 1170 1175 tct gag cat tct tta tta gat cct cca gga aaa agt aaa gtt cct cgt 3905 Ser Glu His Ser Leu Leu Asp Pro Pro Gly Lys Ser Lys Val Pro Arg 1180 1185 1190 gac tgg aga gaa caa gta ctt cga cat att gaa gcc aaa aag tta gaa 3953 Asp Trp Arg Glu Gln Val Leu Arg His Ile Glu Ala Lys Lys Leu Glu 1195 1200 1205 1210 aag atg cct ttg agt aat gga cag atg ggc cag cct ctc agg cct cag 4001 Lys Met Pro Leu Ser Asn Gly Gln Met Gly Gln Pro Leu Arg Pro Gln 1215 1220 1225 gca aat tat agt caa ata cat cac ccc cct cag gca tct gtg gca agg 4049 Ala Asn Tyr Ser Gln Ile His His Pro Pro Gln Ala Ser Val Ala Arg 1230 1235 1240 cat ccc tct aga gaa caa cta att gat tac ttg atg ctg aaa gtg gcc 4097 His Pro Ser Arg Glu Gln Leu Ile Asp Tyr Leu Met Leu Lys Val Ala 1245 1250 1255 cac cag cct cca tat aca cag ccc cat tgt tct cct aga caa ggc cat 4145 His Gln Pro Pro Tyr Thr Gln Pro His Cys Ser Pro Arg Gln Gly His 1260 1265 1270 gaa ctg gca aaa caa gag att cga gtg agg gtt gaa aag gat cca gaa 4193 Glu Leu Ala Lys Gln Glu Ile Arg Val Arg Val Glu Lys Asp Pro Glu 1275 1280 1285 1290 ctt gga ttt agc ata tca ggt ggt gtc ggg ggt aga gga aac cca ttc 4241 Leu Gly Phe Ser Ile Ser Gly Gly Val Gly Gly Arg Gly Asn Pro Phe 1295 1300 1305 aga cct gat gat gat ggt ata ttt gta aca agg gta caa cct gaa gga 4289 Arg Pro Asp Asp Asp Gly Ile Phe Val Thr Arg Val Gln Pro Glu Gly 1310 1315 1320 cca gca tca aaa tta ctg cag cca ggt gat aaa att att cag gct aat 4337 Pro Ala Ser Lys Leu Leu Gln Pro Gly Asp Lys Ile Ile Gln Ala Asn 1325 1330 1335 ggc tac agt ttt ata aat att gaa cat gga caa gca gtg tcc ttg cta 4385 Gly Tyr Ser Phe Ile Asn Ile Glu His Gly Gln Ala Val Ser Leu Leu 1340 1345 1350 aaa act ttc cag aat aca gtt gaa ctc atc att gta cga gaa gtt tcc 4433 Lys Thr Phe Gln Asn Thr Val Glu Leu Ile Ile Val Arg Glu Val Ser 1355 1360 1365 1370 tca taagcactgt ggacaaaaaa agcggggaag acagcaagat ttattggaag 4486 Ser atacttacag gggaaattaa tattttgact atttttatat ataaagaaga actcaaaaaa 4546 ttatgttcaa atttgtacat taatgaaata atggataaag gagactgttg aattcatacc 4606 atataaaact tgttaggttt ttaaacatag caatcaaggc tacaaaaaca aacctgtgtt 4666 gtttttgtat agattgtagg tttatttttg gatttcatat acatgactga actgtgtgca 4726 aggcaatagt tagccttgat tttagcccag agacagatgg cagagctatc tctctcatag 4786 cttttatgcc cttattttta ttcaactggt attaatgttt ttctcctgaa actacttttt 4846 ttgatgtggg caagagattt gaagtgttgg cttttgctat gtgcatattg aattgaagag 4906 tgagtaggtg aaggtggtgc tggtgggttc actttccaag gccagactaa aacagttatt 4966 ttctataaaa atctggaagc aaagaatggg gatggggaga gctacgtggt agtatgtttt 5026 tattaggaga ataatgcaat aaaatatgta atgtcttttt tataaagcaa aaaagacaat 5086 aattgcattt atgagctcgg caggatctgt tcttgtcata gccattgact atacatttgc 5146 tactggtgat tcagttttta attttttagt cacaggaaat ttttaactct actgtagatg 5206 catgtccatg cattttctgt gttatggaaa tccactgatt tttttttttt tttcaaatgg 5266 tggtacttgc aatctgtttt ataattagtg ctccatttaa atctaattta taatttttat 5326 tttaagcagc aaatgaaaca aaaatggcca gttttaagat tgtgttgcct gtaacacaaa 5386 atgttacgaa ggtttaggaa agcctctttg atttttgttt ggccttgcat tggcccttgg 5446 taaagtaaaa ggaaacagta cacttggagc taggaaacca aagcaagctt tgtgaaactg 5506 gcacagtgat agagaattgc tgtggagagt tatagagcaa agggatgggt ccttgaggcc 5566 tgccagtgtg taaaggtgtt caaataaagg gctgtttcta caggtaacat taaatgtgaa 5626 ctcaacactt ccagagtctt taaagggttt ctatgtgtat cagtgtaata gtgttttacc 5686 accaactgcc tttctttgtt cctagttact gtaacaaata tttgatgata gaggtttatt 5746 aattttgttt atccagacca ttaattttat ttgtttttgt ctatgtaatc aaataaaatt 5806 tgagtaacat gtaatggtaa ggattaatgc atggttattt ggaccagaaa aaagtgccat 5866 agaagaccaa taactgttta gttgaggcta gtctggaacc tttcattaga gcaatatttg 5926 gttattgcac ttcattttta tttactaaga aatgcaattt gggaattttt aatctgttat 5986 gctttgttta tcaaccttga ttttaattaa gacttttata agactagctt aaaacaccaa 6046 ccaacattat ttttgcaaaa gtgagttgga ctcactttcc attcttgcta gtcagagtaa 6106 gtaggcagca cttttaaaaa tatgtgaact caaatattgc acttctttca agatgttatc 6166 aattggttat tgtactgtat agttttaata attttgattg aaacccttta acaactcttt 6226 gtaaatttta actcatttta gttgattttc agtactattt acataggaat tgatttttat 6286 ggatatagta gaagaaatgt gctgtatttt gataaaattc acttattgta tgtgtgttgt 6346 aatctaaaaa aaaaaagaat gacaaacagc ttctttaaga caaaaaaaaa aaaaaaaaaa 6406 aaatat 6412 2 1371 PRT Homo sapiens 2 Met Thr Thr Lys Arg Ser Leu Phe Val Arg Leu Val Pro Cys Arg Cys 1 5 10 15 Leu Arg Gly Glu Glu Glu Thr Val Thr Thr Leu Asp Tyr Ser His Cys 20 25 30 Ser Leu Glu Gln Val Pro Lys Glu Ile Phe Thr Phe Glu Lys Thr Leu 35 40 45 Glu Glu Leu Tyr Leu Asp Ala Asn Gln Ile Glu Glu Leu Pro Lys Gln 50 55 60 Leu Phe Asn Cys Gln Ser Leu His Lys Leu Ser Leu Pro Asp Asn Asp 65 70 75 80 Leu Thr Thr Leu Pro Ala Ser Ile Ala Asn Leu Ile Asn Leu Arg Glu 85 90 95 Leu Asp Val Ser Lys Asn Gly Ile Gln Glu Phe Pro Glu Asn Ile Lys 100 105 110 Asn Cys Lys Val Leu Thr Ile Val Glu Ala Ser Val Asn Pro Ile Ser 115 120 125 Lys Leu Pro Asp Gly Phe Ser Gln Leu Leu Asn Leu Thr Gln Leu Tyr 130 135 140 Leu Asn Asp Ala Phe Leu Glu Phe Leu Pro Ala Asn Phe Gly Arg Leu 145 150 155 160 Thr Lys Leu Gln Ile Leu Glu Leu Arg Glu Asn Gln Leu Lys Met Leu 165 170 175 Pro Lys Thr Met Asn Arg Leu Thr Gln Leu Glu Arg Leu Asp Leu Gly 180 185 190 Ser Asn Glu Phe Thr Glu Val Pro Glu Val Leu Glu Gln Leu Ser Gly 195 200 205 Leu Lys Glu Phe Trp Met Asp Ala Asn Arg Leu Thr Phe Ile Pro Gly 210 215 220 Phe Ile Gly Ser Leu Lys Gln Leu Thr Tyr Leu Asp Val Ser Lys Asn 225 230 235 240 Asn Ile Glu Met Val Glu Glu Gly Ile Ser Thr Cys Glu Asn Leu Gln 245 250 255 Asp Leu Leu Leu Ser Ser Asn Ser Leu Gln Gln Leu Pro Glu Thr Ile 260 265 270 Gly Ser Leu Lys Asn Ile Thr Thr Leu Lys Ile Asp Glu Asn Gln Leu 275 280 285 Met Tyr Leu Pro Asp Ser Ile Gly Gly Leu Ile Ser Val Glu Glu Leu 290 295 300 Asp Cys Ser Phe Asn Glu Val Glu Ala Leu Pro Ser Ser Ile Gly Gln 305 310 315 320 Leu Thr Asn Leu Arg Thr Phe Ala Ala Asp His Asn Tyr Leu Gln Gln 325 330 335 Leu Pro Pro Glu Ile Gly Ser Trp Lys Asn Ile Thr Val Leu Phe Leu 340 345 350 His Ser Asn Lys Leu Glu Thr Leu Pro Glu Glu Met Gly Asp Met Gln 355 360 365 Lys Leu Lys Val Ile Asn Leu Ser Asp Asn Arg Leu Lys Asn Leu Pro 370 375 380 Phe Ser Phe Thr Lys Leu Gln Gln Leu Thr Ala Met Trp Leu Ser Asp 385 390 395 400 Asn Gln Ser Lys Pro Leu Ile Pro Leu Gln Lys Glu Thr Asp Ser Glu 405 410 415 Thr Gln Lys Met Val Leu Thr Asn Tyr Met Phe Pro Gln Gln Pro Arg 420 425 430 Thr Glu Asp Val Met Phe Ile Ser Asp Asn Glu Ser Phe Asn Pro Ser 435 440 445 Leu Trp Glu Glu Gln Arg Lys Gln Arg Ala Gln Val Ala Phe Glu Cys 450 455 460 Asp Glu Asp Lys Asp Glu Arg Glu Ala Pro Pro Arg Glu Gly Asn Leu 465 470 475 480 Lys Arg Tyr Pro Thr Pro Tyr Pro Asp Glu Leu Lys Asn Met Val Lys 485 490 495 Thr Val Gln Thr Ile Val His Arg Leu Lys Asp Glu Glu Thr Asn Glu 500 505 510 Asp Ser Gly Arg Asp Leu Lys Pro His Glu Asp Gln Gln Asp Ile Asn 515 520 525 Lys Asp Val Gly Val Lys Thr Ser Glu Ser Thr Thr Thr Val Lys Ser 530 535 540 Lys Val Asp Glu Arg Glu Lys Tyr Met Ile Gly Asn Ser Val Gln Lys 545 550 555 560 Ile Ser Glu Pro Glu Ala Glu Ile Ser Pro Gly Ser Leu Pro Val Thr 565 570 575 Ala Asn Met Lys Ala Ser Glu Asn Leu Lys His Ile Val Asn His Asp 580 585 590 Asp Val Phe Glu Glu Ser Glu Glu Leu Ser Ser Asp Glu Glu Met Lys 595 600 605 Met Ala Glu Met Arg Pro Pro Leu Ile Glu Thr Ser Ile Asn Gln Pro 610 615 620 Lys Val Val Ala Leu Ser Asn Asn Lys Lys Asp Asp Thr Lys Glu Thr 625 630 635 640 Asp Ser Leu Ser Asp Glu Val Thr His Asn Ser Asn Gln Asn Asn Ser 645 650 655 Asn Cys Ser Ser Pro Ser Arg Met Ser Asp Ser Val Ser Leu Asn Thr 660 665 670 Asp Ser Ser Gln Asp Thr Ser Leu Cys Ser Pro Val Lys Gln Thr His 675 680 685 Ile Asp Ile Asn Ser Lys Ile Arg Gln Glu Asp Glu Asn Phe Asn Ser 690 695 700 Leu Leu Gln Asn Gly Asp Ile Leu Asn Ser Ser Thr Glu Glu Lys Phe 705 710 715 720 Lys Ala His Asp Lys Lys Asp Phe Asn Leu Pro Glu Tyr Asp Leu Asn 725 730 735 Val Glu Glu Arg Leu Val Leu Ile Glu Lys Ser Val Asp Ser Thr Ala 740 745 750 Thr Ala Asp Asp Thr His Lys Leu Asp His Ile Asn Met Asn Leu Asn 755 760 765 Lys Leu Ile Thr Asn Asp Thr Phe Gln Pro Glu Ile Met Glu Arg Ser 770 775 780 Lys Thr Gln Asp Ile Val Leu Gly Thr Ser Phe Leu Ser Ile Asn Ser 785 790 795 800 Lys Glu Glu Thr Glu His Leu Glu Asn Gly Asn Lys Tyr Pro Asn Leu 805 810 815 Glu Ser Val Asn Lys Val Asn Gly His Ser Glu Glu Thr Ser Gln Ser 820 825 830 Pro Asn Arg Thr Glu Pro His Asp Ser Asp Cys Ser Val Asp Leu Gly 835 840 845 Ile Ser Lys Ser Thr Glu Asp Leu Ser Pro Gln Lys Ser Gly Pro Val 850 855 860 Gly Ser Val Val Lys Ser His Ser Ile Thr Asn Met Glu Ile Gly Gly 865 870 875 880 Leu Lys Ile Tyr Asp Ile Leu Ser Asp Asn Gly Pro Gln Gln Pro Ser 885 890 895 Thr Thr Val Lys Ile Thr Ser Ala Val Asp Gly Lys Asn Ile Val Arg 900 905 910 Ser Lys Ser Ala Thr Leu Leu Tyr Asp Gln Pro Leu Gln Val Phe Thr 915 920 925 Gly Ser Ser Ser Ser Ser Asp Leu Ile Ser Gly Thr Lys Ala Ile Phe 930 935 940 Lys Phe Asp Ser Asn His Asn Pro Glu Glu Pro Asn Ile Ile Arg Gly 945 950 955 960 Pro Thr Ser Gly Pro Gln Ser Ala Pro Gln Ile Tyr Gly Pro Pro Gln 965 970 975 Tyr Asn Ile Gln Tyr Ser Ser Ser Ala Ala Val Lys Asp Thr Leu Trp 980 985 990 His Ser Lys Gln Asn Pro Gln Ile Asp His Ala Ser Phe Pro Pro Gln 995 1000 1005 Leu Leu Pro Arg Ser Glu Ser Thr Glu Asn Gln Ser Tyr Ala Lys His 1010 1015 1020 Ser Ala Asn Met Asn Phe Ser Asn His Asn Asn Val Arg Ala Asn Thr 1025 1030 1035 1040 Ala Tyr His Leu His Gln Arg Leu Gly Pro Ala Arg His Gly Glu Met 1045 1050 1055 Trp Ala Ile Ser Pro Asn Asp Arg Leu Ile Pro Ala Val Thr Arg Ser 1060 1065 1070 Thr Ile Gln Arg Gln Ser Ser Val Ser Ser Thr Ala Ser Val Asn Leu 1075 1080 1085 Gly Asp Pro Gly Ser Thr Arg Arg Ala Gln Ile Pro Glu Gly Asp Tyr 1090 1095 1100 Leu Ser Tyr Arg Glu Phe His Ser Ala Gly Arg Thr Pro Pro Met Met 1105 1110 1115 1120 Pro Gly Ser Gln Arg Pro Leu Ser Ala Arg Thr Tyr Ser Ile Asp Gly 1125 1130 1135 Pro Asn Ala Ser Arg Pro Gln Ser Ala Arg Pro Ser Ile Asn Glu Ile 1140 1145 1150 Pro Glu Arg Thr Met Ser Val Ser Asp Phe Asn Tyr Ser Arg Thr Ser 1155 1160 1165 Pro Ser Lys Arg Pro Asn Ala Arg Val Gly Ser Glu His Ser Leu Leu 1170 1175 1180 Asp Pro Pro Gly Lys Ser Lys Val Pro Arg Asp Trp Arg Glu Gln Val 1185 1190 1195 1200 Leu Arg His Ile Glu Ala Lys Lys Leu Glu Lys Met Pro Leu Ser Asn 1205 1210 1215 Gly Gln Met Gly Gln Pro Leu Arg Pro Gln Ala Asn Tyr Ser Gln Ile 1220 1225 1230 His His Pro Pro Gln Ala Ser Val Ala Arg His Pro Ser Arg Glu Gln 1235 1240 1245 Leu Ile Asp Tyr Leu Met Leu Lys Val Ala His Gln Pro Pro Tyr Thr 1250 1255 1260 Gln Pro His Cys Ser Pro Arg Gln Gly His Glu Leu Ala Lys Gln Glu 1265 1270 1275 1280 Ile Arg Val Arg Val Glu Lys Asp Pro Glu Leu Gly Phe Ser Ile Ser 1285 1290 1295 Gly Gly Val Gly Gly Arg Gly Asn Pro Phe Arg Pro Asp Asp Asp Gly 1300 1305 1310 Ile Phe Val Thr Arg Val Gln Pro Glu Gly Pro Ala Ser Lys Leu Leu 1315 1320 1325 Gln Pro Gly Asp Lys Ile Ile Gln Ala Asn Gly Tyr Ser Phe Ile Asn 1330 1335 1340 Ile Glu His Gly Gln Ala Val Ser Leu Leu Lys Thr Phe Gln Asn Thr 1345 1350 1355 1360 Val Glu Leu Ile Ile Val Arg Glu Val Ser Ser 1365 1370 3 1647 DNA Mus musculus CDS (1)..(1485) 3 gcg tcc ggc aag tct gcc aca ctg ctg tat gat cag ccc ctg cag gtg 48 Ala Ser Gly Lys Ser Ala Thr Leu Leu Tyr Asp Gln Pro Leu Gln Val 1 5 10 15 ttc act gct gcc tcc tca tct tct gag tta ctg tca ggt aca aag gca 96 Phe Thr Ala Ala Ser Ser Ser Ser Glu Leu Leu Ser Gly Thr Lys Ala 20 25 30 gtt ttc aag ttt gat tca aat cac aac cct gaa gag cca gat ata ata 144 Val Phe Lys Phe Asp Ser Asn His Asn Pro Glu Glu Pro Asp Ile Ile 35 40 45 aga gct gcc aca gta tct ggc cca cag tct aca cct cac ctg tat ggt 192 Arg Ala Ala Thr Val Ser Gly Pro Gln Ser Thr Pro His Leu Tyr Gly 50 55 60 ccc cca caa tat aat gtc cag tac agt ggc agt gct aca gtg aaa gac 240 Pro Pro Gln Tyr Asn Val Gln Tyr Ser Gly Ser Ala Thr Val Lys Asp 65 70 75 80 acc ttg tgg cac cct aaa cag aat cca caa ata gat cct gtc agt ttt 288 Thr Leu Trp His Pro Lys Gln Asn Pro Gln Ile Asp Pro Val Ser Phe 85 90 95 cct cct cag cgc ctt cca aga tct gag agt gca gaa aat cac agt tat 336 Pro Pro Gln Arg Leu Pro Arg Ser Glu Ser Ala Glu Asn His Ser Tyr 100 105 110 gca aag cat tct gcc aac atg aat ttc tct aac cat aac aat gtt aga 384 Ala Lys His Ser Ala Asn Met Asn Phe Ser Asn His Asn Asn Val Arg 115 120 125 gcc aat act gga tac cac tta caa cag aga ctt gcc cca gcc agg cat 432 Ala Asn Thr Gly Tyr His Leu Gln Gln Arg Leu Ala Pro Ala Arg His 130 135 140 gga gaa atg tgg gct atc tcc ccg aat gac cga cta gtg cca gca gtc 480 Gly Glu Met Trp Ala Ile Ser Pro Asn Asp Arg Leu Val Pro Ala Val 145 150 155 160 act cgg act act att cag aga cag agt agc gtg tct tca acg gct tct 528 Thr Arg Thr Thr Ile Gln Arg Gln Ser Ser Val Ser Ser Thr Ala Ser 165 170 175 gta aat ctc ggt gac ccc aca agg cgt act gaa gga gat tac tta tcg 576 Val Asn Leu Gly Asp Pro Thr Arg Arg Thr Glu Gly Asp Tyr Leu Ser 180 185 190 tac aga gag tta cat tca atg gga aga act cca gtc atg tca gga tca 624 Tyr Arg Glu Leu His Ser Met Gly Arg Thr Pro Val Met Ser Gly Ser 195 200 205 cag aga cct ctt tct gca cga gcg tac agc atc gat ggc cca aat aca 672 Gln Arg Pro Leu Ser Ala Arg Ala Tyr Ser Ile Asp Gly Pro Asn Thr 210 215 220 tcc agg cct cag agt gcc cgt ccc tct att aat gaa ata cca gag aga 720 Ser Arg Pro Gln Ser Ala Arg Pro Ser Ile Asn Glu Ile Pro Glu Arg 225 230 235 240 act atg tca gtt agt gat ttc aat tac tca cgg act agt cct tca aaa 768 Thr Met Ser Val Ser Asp Phe Asn Tyr Ser Arg Thr Ser Pro Ser Lys 245 250 255 aga cca aat aca agg gtc ggg tct gaa cat tct ctg tta gat cct cca 816 Arg Pro Asn Thr Arg Val Gly Ser Glu His Ser Leu Leu Asp Pro Pro 260 265 270 gga aaa agc aag gtt cct cat gac tgg cgg gaa caa gta cta cga cac 864 Gly Lys Ser Lys Val Pro His Asp Trp Arg Glu Gln Val Leu Arg His 275 280 285 att gag gcc aaa aag tta gaa aag cat ccg cag aca tcc agt cca gga 912 Ile Glu Ala Lys Lys Leu Glu Lys His Pro Gln Thr Ser Ser Pro Gly 290 295 300 gag tgt tgc caa gat gat aga ttc atg tca gaa gag cag aac cac cca 960 Glu Cys Cys Gln Asp Asp Arg Phe Met Ser Glu Glu Gln Asn His Pro 305 310 315 320 tca ggt gca ctt agt cac aga ggt ctt cca gac agc ctg atg aaa atg 1008 Ser Gly Ala Leu Ser His Arg Gly Leu Pro Asp Ser Leu Met Lys Met 325 330 335 cct ttg agt aat gga caa atg ggc cag cct ctc agg cct cag gca cat 1056 Pro Leu Ser Asn Gly Gln Met Gly Gln Pro Leu Arg Pro Gln Ala His 340 345 350 tat agt caa acc cat cac ccc cct cag gca tct gtg gca agg cat ccc 1104 Tyr Ser Gln Thr His His Pro Pro Gln Ala Ser Val Ala Arg His Pro 355 360 365 tct aga gaa caa cta att gat tat ttg atg ctg aaa gtg gcc cac cag 1152 Ser Arg Glu Gln Leu Ile Asp Tyr Leu Met Leu Lys Val Ala His Gln 370 375 380 cct cca tat aca cat ccc cat tgt tct cct aga caa ggc cat gaa ctg 1200 Pro Pro Tyr Thr His Pro His Cys Ser Pro Arg Gln Gly His Glu Leu 385 390 395 400 gca aag caa gag att cga gta aga gta gaa aag gat cca gaa ctt gga 1248 Ala Lys Gln Glu Ile Arg Val Arg Val Glu Lys Asp Pro Glu Leu Gly 405 410 415 ttt agc ata tca gga ggt gtc ggc ggc aga gga aac cct ttc aga cct 1296 Phe Ser Ile Ser Gly Gly Val Gly Gly Arg Gly Asn Pro Phe Arg Pro 420 425 430 gat gat gat ggt ata ttt gta aca agg gta caa cct gaa gga cca gca 1344 Asp Asp Asp Gly Ile Phe Val Thr Arg Val Gln Pro Glu Gly Pro Ala 435 440 445 tca aaa tta cta cag cca gga gat aaa att att cag gct aat ggc tac 1392 Ser Lys Leu Leu Gln Pro Gly Asp Lys Ile Ile Gln Ala Asn Gly Tyr 450 455 460 agt ttt atc aac att gaa cat ggg caa gcg gtg tcc ttg tta aaa act 1440 Ser Phe Ile Asn Ile Glu His Gly Gln Ala Val Ser Leu Leu Lys Thr 465 470 475 480 ttc cat aat gca gta gat ctc atc att gta cga gaa gtt tct tca 1485 Phe His Asn Ala Val Asp Leu Ile Ile Val Arg Glu Val Ser Ser 485 490 495 taagcactat agacaaaaac aatgggggaa gatagcaaga tttatggaag acacttgcaa 1545 gggaaatgac tattttgctt atttttatat ataaagaaga actcaactgt tggtttgaaa 1605 cttgtacatt actgaaataa tggaccttgt ggttagaggg aa 1647 4 495 PRT Mus musculus 4 Ala Ser Gly Lys Ser Ala Thr Leu Leu Tyr Asp Gln Pro Leu Gln Val 1 5 10 15 Phe Thr Ala Ala Ser Ser Ser Ser Glu Leu Leu Ser Gly Thr Lys Ala 20 25 30 Val Phe Lys Phe Asp Ser Asn His Asn Pro Glu Glu Pro Asp Ile Ile 35 40 45 Arg Ala Ala Thr Val Ser Gly Pro Gln Ser Thr Pro His Leu Tyr Gly 50 55 60 Pro Pro Gln Tyr Asn Val Gln Tyr Ser Gly Ser Ala Thr Val Lys Asp 65 70 75 80 Thr Leu Trp His Pro Lys Gln Asn Pro Gln Ile Asp Pro Val Ser Phe 85 90 95 Pro Pro Gln Arg Leu Pro Arg Ser Glu Ser Ala Glu Asn His Ser Tyr 100 105 110 Ala Lys His Ser Ala Asn Met Asn Phe Ser Asn His Asn Asn Val Arg 115 120 125 Ala Asn Thr Gly Tyr His Leu Gln Gln Arg Leu Ala Pro Ala Arg His 130 135 140 Gly Glu Met Trp Ala Ile Ser Pro Asn Asp Arg Leu Val Pro Ala Val 145 150 155 160 Thr Arg Thr Thr Ile Gln Arg Gln Ser Ser Val Ser Ser Thr Ala Ser 165 170 175 Val Asn Leu Gly Asp Pro Thr Arg Arg Thr Glu Gly Asp Tyr Leu Ser 180 185 190 Tyr Arg Glu Leu His Ser Met Gly Arg Thr Pro Val Met Ser Gly Ser 195 200 205 Gln Arg Pro Leu Ser Ala Arg Ala Tyr Ser Ile Asp Gly Pro Asn Thr 210 215 220 Ser Arg Pro Gln Ser Ala Arg Pro Ser Ile Asn Glu Ile Pro Glu Arg 225 230 235 240 Thr Met Ser Val Ser Asp Phe Asn Tyr Ser Arg Thr Ser Pro Ser Lys 245 250 255 Arg Pro Asn Thr Arg Val Gly Ser Glu His Ser Leu Leu Asp Pro Pro 260 265 270 Gly Lys Ser Lys Val Pro His Asp Trp Arg Glu Gln Val Leu Arg His 275 280 285 Ile Glu Ala Lys Lys Leu Glu Lys His Pro Gln Thr Ser Ser Pro Gly 290 295 300 Glu Cys Cys Gln Asp Asp Arg Phe Met Ser Glu Glu Gln Asn His Pro 305 310 315 320 Ser Gly Ala Leu Ser His Arg Gly Leu Pro Asp Ser Leu Met Lys Met 325 330 335 Pro Leu Ser Asn Gly Gln Met Gly Gln Pro Leu Arg Pro Gln Ala His 340 345 350 Tyr Ser Gln Thr His His Pro Pro Gln Ala Ser Val Ala Arg His Pro 355 360 365 Ser Arg Glu Gln Leu Ile Asp Tyr Leu Met Leu Lys Val Ala His Gln 370 375 380 Pro Pro Tyr Thr His Pro His Cys Ser Pro Arg Gln Gly His Glu Leu 385 390 395 400 Ala Lys Gln Glu Ile Arg Val Arg Val Glu Lys Asp Pro Glu Leu Gly 405 410 415 Phe Ser Ile Ser Gly Gly Val Gly Gly Arg Gly Asn Pro Phe Arg Pro 420 425 430 Asp Asp Asp Gly Ile Phe Val Thr Arg Val Gln Pro Glu Gly Pro Ala 435 440 445 Ser Lys Leu Leu Gln Pro Gly Asp Lys Ile Ile Gln Ala Asn Gly Tyr 450 455 460 Ser Phe Ile Asn Ile Glu His Gly Gln Ala Val Ser Leu Leu Lys Thr 465 470 475 480 Phe His Asn Ala Val Asp Leu Ile Ile Val Arg Glu Val Ser Ser 485 490 495 5 3996 DNA Mus musculus 5 gcggccgcgg taggatcagg cttgtagtcc taacttccga ggctgaggca ggaggctcac 60 ttgcctccag gaggccagcc gaggctacat gagaccttgt ctcaaacaaa caaacaaaca 120 aacaaacaga gtaagtaaag aaaaatctga atttagtctg acatttgaat tactcttttt 180 gtagcctttg tccacaaaag ttttgagtgt gcatgctatg tggcttccta gaacaaatca 240 catatttaat actccttttc tctattaatt gtcctgttat aattttgtat ccatttgctc 300 tgtgaagaca aattattaga aactttcaat ttttcaatat catatccaaa atacagtgtc 360 tttcagatta catacattag gtaataacat cccataccat tcattatatt tctaaatcta 420 gaatggtttc tgggaaatgc attttttaaa aagccatatc atccaggttc ttatacaccg 480 tcactgaaca cttatttgtt cagacgtgtt tgttaatgta tggtatgtga gaatcgtagt 540 ttgaatcact gaagtaattc tatgcaattg aactttctat aatgatggga atattatgta 600 atgtttgagc cactttggta tccttagtag ggaataagga aatgacattt taaattttat 660 ttgatttaac cttaatatta agcatgtata tctactggct accataccgg acatcaggta 720 gcaggttatc ttattagtat acaacgataa gcttaatccc tttttatcag ttacattcct 780 tattgaatga attgtttatt cctggtggta aaatgtgcat ttttataaac attaagtttt 840 tataaacaca aagttttcct ctaaaatctt tatctagaaa taagcataca aaaaaagaga 900 atagaattct tactctgtgt gtgtgatcgg attctaaccc aggacaagcc atacatggtc 960 atgttgtcat tgcctcagat gaaaacatcg ggtgtttcac ttactagatt tttaaaaaat 1020 ctttaaatag tgtataaata gacctattga tactaggttt tgaggatcta gttcaaatgt 1080 atattatatg ctttttcaaa aatctgtttt atgagtctgt gtagttctaa aaaagcaact 1140 cttaagtacc atgttaaaga tctttatttt tgactgttgt ttgtagttaa gaaaatctgg 1200 ttctctatgg aagcagtagg atattctcta aaaaattgta atgaagataa agaaaatatt 1260 tataaacatt atccaaagag ctacagcata gtgcctgatg attgagtaaa gtcagtgtag 1320 aaacagttgt actgggttgg ctgaagtatc gttaagtggg aagcagagcc tttagtgcca 1380 cattgtatcc tgaggtaagg aaaaaccaat gttggttgta tacactttac ttaaacattc 1440 gcattttgtt cataaatgat tattctttta agtaattaga gtatttaaag tattgccttg 1500 ctgatttttt gttactttct tatattttat gcctgaggtc ttactgattt aactctggct 1560 taatgctgtt taaaagtatt gtttctgtac ctaaggcaag aattatattc atgtacaaag 1620 aattactaac caattggtga tcattctcat tttcttaagt aatttcactg ctagtttact 1680 cttagcctag ttagaattta tacacaactt cttaataaaa ctggccttgt gaatttttat 1740 taattcaaag ttggctttct ctccctgtta actgagagac tgtattttaa gtgagtttat 1800 gcatatataa tcttaaacaa cttagaacct ataagctagt attagattaa aacaaaatta 1860 tacttggagt agatgggact tttcatgtag gagaatcata agaaagttaa atgtggttga 1920 tctctgatga acggttgaga agacctgaaa atgtctgcac agtcatttcc tattcagaaa 1980 tgcacagtca ggttcttgac taagagacag aactcttagt tggaaattag aggaaaacct 2040 tttactgttt ttaacagcta agcagacttc acaacgcatg gttttataaa aacgtaggag 2100 tattttgttt tgtgttgtat tcttggattg ttttgtaaag taccagagag cctgagttta 2160 atttgaagtc cagtagactg tcaactgttg atattaatag gctcagttgg gtgtatgtgg 2220 tggtggaggt attgtttgtc tgtggcatgt ggaattgtgc tgtatggacg gtcatttgtt 2280 ggaaaagttg ttgttatatt atatgcattc tgatttgttt ttatgttcat gctgcagcat 2340 ttaaaaatga ctacaaaacg aagtttgttt gtgcggttgg taccatgtcg ctgtctacga 2400 ggagaagagg agacagtcac tactctagat tattctcatt gcagcttgga acaagttccc 2460 aaagagattt tcacctttga aaaaacttta gaggagctct atttagatgc taatcagatt 2520 gaagagcttc caaaggtatg ttaatactat caagaattat attaggtaat gtctatgcat 2580 agcagtttat attcttatat tgttgtatat atgtgttcct gtttattggt ttagtcttac 2640 tgtaagcaca attcaaaatg atgtagaact ttgttcaagt tgtttaagta cttgtgtgga 2700 taattgaatt ttatttttac ttataacaat taaaatagat aatcagtata tatgacttct 2760 ggaaccatta gatttaacca gactaagaaa agtgaactct tactgtaatt tctataagca 2820 ctttatatgc attgctaaca ttgttacttc cattgctaca gcttttatgg ttctgtaact 2880 tcttagaggt gggattctgc tgttttccat tttgtatgga attgatttgg agacagtgca 2940 tctctgaagt ctagtaacaa ctggtagggg acagattgct gctgtgacgt tctttctcag 3000 attgccactg tgtttattat tgttatttta gtaagctctt ttttctctta tctgtgtttg 3060 ttttcctgtg tatgaaaagg ttgcagtagc tttaaattgt ttattacagt ctgtatccct 3120 ctgaatggca gacagggtct taccttgtag actagactgt cctcaactct cagagatctg 3180 cctgcccttg ctttccaagg ctgtggttga aggtgtacac caccttgctc tttgaagctg 3240 taaatggaga cttgaaattg atggcttaaa aaacaagtta gactttggca tggtggtacc 3300 taaatgagaa agatttctag ttcaaggcca gtctgaccta cacagtgggt tctgtcatgg 3360 cctggatcat atatataatc agaccttggt ttcaaaacct cctgaccatg gtcacgaatg 3420 aacaatttaa tatgcattac cagaattctg aaaaataaac tgtaatttgg actatgagat 3480 ttagacgaca aaattataga atacttagaa tttctgatga aaaagaaatc gtgtaaatgt 3540 gcagtgtgta ggataaacta aaaaactgtt tatctgaaat tcacatttaa ttgggtgtct 3600 cgtaattaat ccagcaaccc taactgaaat gtaacaggta atatatcttg ttgcatatag 3660 actttattta atgaaatagt tgttttcaat atttatgttt gtgaacattc tgaatgacag 3720 tattcaaaag catttgaaag ccactgctgt cctaggtaat gaagttttac aaaactttat 3780 tgagacagga tctcactttg tgatctagac aaacttcaaa cttcattgca atcctcctgc 3840 ctcagcttcc cagatctggg gattataggc tgagccacta tgcctggttt attcttaata 3900 atgtggagaa gtctagaact tttctatttg tgcacagtaa aggtcttcag tttaacatgg 3960 atttgtgaat cacacctgat gcctgcgatt gtagtg 3996 6 144 DNA Homo sapiens 6 agcatgctgt caaggtcctt taattccaat tttactactg taagcagttt tcactgtggc 60 agctctaggg atctgcatgg cagccagggc agtcttgcct tgagtgttgc agacagaaga 120 ggttctggtg ggcacatttt tcga 144 7 48 PRT Homo sapiens 7 Ser Met Leu Ser Arg Ser Phe Asn Ser Asn Phe Thr Thr Val Ser Ser 1 5 10 15 Phe His Cys Gly Ser Ser Arg Asp Leu His Gly Ser Gln Gly Ser Leu 20 25 30 Ala Leu Ser Val Ala Asp Arg Arg Gly Ser Gly Gly His Ile Phe Arg 35 40 45 8 676 DNA Homo sapiens 8 ctcgagacca ggaagccagc ccgcacccca acccccacca aagccaccta ctcttcttct 60 gtgggaggcc agtccacatc cgctctcacc cgagagagat attcagctgg atccaaagtg 120 actgatgaag ggaaggaaat catgtcaagc gaagccttga aaaagctgcc ctgagacggt 180 gtcccgccga aagaatgttg gctcaattaa gaaacatcag ggagataaat tcaacccagt 240 gtgtctaaaa atgactacaa aacgaagttt gtttgtgcgg ttggtaccat gtcgctgtct 300 acgaggggaa gaggagactg tcactactct tgattattct cattgcagct tagaacaagt 360 tccgaaagag atttttactt ttgaaaaaac cttggaggaa ctctatttag atgctaatca 420 gattgaagag cttccaaagc aactttttaa ctgtcagtct ttacacaagc tgagtttgcc 480 agacaatgat ttaacaacgt taccagcatc cattgcaaac cttattaatc tcagggaact 540 ggatgtcagc aagaatggaa tacaggagtt tccagaaaat ataaaaaatt gtaaagtttt 600 gacaattgtg gaggccagtg taaaccctat ttccaagctc cctgatggat tttctcagct 660 gttaaaccta acccag 676 9 366 DNA Homo sapiens 9 ccattcagac ctgatgatga tggtatattt gtaacaaggg tacaacctga aggaccagca 60 tcaaaattac tgcagccagg tgataaaatt attcaggcta atggctacag ttttataaat 120 attgaacatg gacaagcagt gtccttgcta aaaactttcc agaatacagt tgaactcatc 180 attgtacgag aagtttcctc ataagcactg tggacaaaaa aagcggggaa gacagcaaga 240 tttattggaa gatacttaca ggggaaatta atattttgac tatttttata tataaagaag 300 aactcaaaaa attatgttca aatttgtaca ttaatgaaat aatggataaa ggagactgtt 360 gaattc 366 10 31 DNA Homo sapiens 10 gggaggccag tccacatccg ctctcacccg a 31 11 32 DNA Homo sapiens 11 attcagctgg atccaaagtg actgatgaag gg 32 12 20 DNA Homo sapiens 12 actacaaaac gaagtttgtg 20 13 22 DNA Homo sapiens 13 gtttcctcag aatgtccatt ta 22 14 29 DNA Homo sapiens 14 atgtggttca gtcctattag gagactggg 29 15 30 DNA Homo sapiens 15 cagatccaac tggaccactt ttctgagggg 30 16 44 DNA Homo sapiens 16 cagcagatgt gattttaacg gttgtacttg gctgctgagg tcca 44 17 31 DNA Homo sapiens 17 gaggccccac aagtggccca caatctgcac c 31 18 33 DNA Homo sapiens 18 cctcctcagc tccttcctag atcagagagc aca 33 19 29 DNA Homo sapiens 19 cttccaataa atcttgctgt cttccccgc 29 20 29 DNA Homo sapiens 20 atagtcaaaa tattaatttc ccctgtaag 29 21 9 PRT Artificial sequence Description of the artificial sequence C-terminal end of ERBB2/HER2 21 Glu Tyr Leu Gly Leu Asp Val Pro Val 1 5 

1. Isolated polypeptide, called erbin, comprising a sequence of amino acids chosen from amongst SEQ ID no. 2 or no. 4, the sequence SEQ ID no. 2 deleted from residues 1212 to 1280, or the sequence SEQ ID no. 2 with an insertion between the amino acids 1211 and 1212 of the fragment of sequence SEQ ID no.
 7. 2. Isolated nucleic acid, comprising a nucleotide sequence coding for a polypeptide as claimed in claim
 1. 3. Nucleic acid as claimed in claim 2 comprising the sequence chosen from amongst SEQ ID no. 1, SEQ ID no. 3, the sequence SEQ ID no. 1 with a deletion of the nucleotides 3957 to 4163, or the sequence SEQ ID no. 1 with an insertion between the nucleotides 3956 and 3957 of the fragment of sequence SEQ ID no.
 6. 4. Isolated nucleic acid comprising an isolated nucleic acid sequence chosen from amongst the sequence SEQ ID no. 5, or the sequence comprising the nucleotides 2347 to 2535 on SEQ ID no.
 5. 5. Isolated polypeptide, comprising a sequence chosen from amongst: a) the residues 1 to 501 of SEQ ID no. 2; the residues 296 to 334 of SEQ ID no. 4; b) the sequence SEQ ID no. 7; c) the sequence SEQ ID no. 2 with a mutation of the histidine 1347 into leucine; the sequence SEQ ID no. 2 with a mutation of the leucine 1291 into methionine and of the phenylalanine 1293 into isoleucine; the sequence SEQ ID no. 2 with a mutation of the histidine 1347 into tyrosine and of the glycine 1348 into aspartic acid; and the sequence SEQ ID no. 2 with a mutation of the threonine 1316 into asparagine and of the arginine 1317 into serine.
 6. Isolated polypeptide, consisting of a sequence chosen from amongst: the residues 1279 to 1371 of SEQ ID no. 2; the residues 502 to 1278 of SEQ ID no. 2 the residues 914 to 1371 of SEQ ID no. 2; and the residues 1212 to 1280 of SEQ ID no.
 2. 7. Isolated nucleic acid comprising a nucleotide sequence coding for the polypeptide of claim 5 or
 6. 8. Cloning and/or expression vector containing a nucleic acid as claimed in any one of claims 2, 3, 4 or
 7. 9. Host cell transfected by a vector as claimed in claim
 8. 10. Use of a nucleic acid as claimed in one of claims 2, 3, 4 or 7 for obtaining probes or triggers having at least 15 nucleotides, which hybridise specifically with the nucleic acid sequence of claims 2, 3, 4 or 7 or its complementary sequence in stringent hybridisation conditions.
 11. Nucleic probe of which the sequence is chosen from amongst the sequence ID no. 8 or SEQ ID no.
 9. 12. Nucleic trigger of which the sequence is chosen from amongst the sequence SEQ ID no. 10 to SEQ ID no.
 20. 13. Process for the production of a recombinant polypeptide, in which a vector containing a nucleic acid as claimed in claim 2, 3, 4 or 7 is transferred into a host cell, which is cultured in conditions permitting the expression of the polypeptide as claimed in claim 1 or 5, or of a polypeptide coded by a nucleic acid sequence such as is defined in claim
 4. 14. Process for diagnosis in vitro of a tumour or of a predisposition to develop a tumour, comprising the steps consisting of: a1) putting a biological specimen containing DNA or RNA into the presence of specific oligonucleotides permitting the amplification of all or part of the erbin gene or of its transcript, comprising a sequence such as is defined in one of claims 2, 3, 4 or 7; b1) amplifying the said DNA or RNA; c1) detecting the products of amplification; d1) comparing the products of amplification which are obtained with those obtained with a control specimen, and in this way detecting a possible anomaly in the erbin gene or in its transcript, indicating a predisposition to develop a tumour, or a2) putting a biological specimen containing mRNA, obtained by taking a sample of suspect cells from a patient, into the presence of specific oligonucleotides permitting the amplification of all or part of the transcript of the erbin gene, comprising a sequence such as is defined in claim 2, 3, 4 or 6; b2) amplifying the said transcript; c2) detecting and quantifying the products of amplification; a modification in the amount of erbin transcript with reference to the normal control indicating a tumour or a predisposition to develop a tumour.
 15. Antibody directed against the polypeptide as defined in claim 1, 5 or
 6. 16. Use of at least one antibody as claimed in claim 15 for the detection or the purification of a polypeptide such as is defined in claim 1 or 5 in a biological specimen.
 16. Method of diagnosis in vitro of a tumour or of a predisposition to develop a tumour, comprising placing at least one antibody directed against the polypeptide as defined in claim 1, 5 or 6 in contact with a biological specimen obtained from a sample of suspect cells taken from a patient, in conditions permitting the possible formation of specific immunological complexes between the polypeptide as defined in claim 1, 5 or 6 and the said antibody or antibodies and the detection and/or the quantification of the specific immunological complexes which may be formed.
 18. Kit comprising: at least one antibody as claimed in claim 15, possibly fixed on a support; means for revealing the formation of specific antigen/antibody complexes between the polypeptide of claim 1, 5 or 6 and the said antibody and/or means for quantification of these complexes.
 19. Pharmaceutical composition comprising a nucleic acid as claimed in one of claims 2, 3, 4 or 7, an anti-sense nucleic acid of the nucleic acid as claimed in one of claims 2, 3, 4 or 7, or a polypeptide as claimed in claim 1, 5 or 6, in association with a pharmaceutically acceptable vehicle.
 20. Use of a nucleic acid as claimed in claim 2, 3, 4 or 7 or of a polypeptide as claimed in claim 1, 5 or 6, for the manufacture of a medication intended for the treatment of tumours.
 21. Process for obtaining a non-human transgenic animal in which the gene coding for the polypeptide as defined in claim 1 is invalidated.
 22. Non-human transgenic animal capable of being obtained by the process of claim
 21. 23. Method of screening molecules inhibiting or activating the bonding between the polypeptide of claim 1 and the erbB2 receptor, comprising: placing a molecule to be tested in contact with (i) a first bonding partner which is the erbB2/HER2 receptor or a fragment thereof capable of fixing the polypeptide of claim 1, and (ii) a second bonding partner which is the polypeptide of claim 1 or a fragment thereof capable of fixing the erbB2/HER2 receptor, the said first and/or second bonding partner(s) being marked as needed in such a way as to be detected, evaluating the inhibition or activation of the interaction between the said bonding partners by the molecule to be tested.
 24. Process for identification of mutants of the polypeptide of claim 1 or mutants of erbB2/HER2 which increase or decrease the interaction between this protein and the erbB2/HER2 receptor, in which: either a mutation is effected on the polypeptide of claim 1, 5 or 6 and the effect of this mutation on the interaction of the said polypeptide with the erbB2/HER2 receptor is evaluated; or a mutation is effected on the erbB2/HER2 receptor and the effect of this mutation on the interaction of the said receptor with the polypeptide of claim 1, 5 or 6 is evaluated. 